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(57) ABSTRACT 

A load balancing system and method for network nodes is 
provided. The load balancing system includes crossbar 
devices, queues to receive data and a load balancer. The load 
balancer determines the amount of data in each of the queues 
and sends data to specific crossbar devices based on the 
amount of data in each queue. The queues include a high 
priority queue and a number of non-high priority queues. 
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AUTOMATIC LOAD BALANCING IN SWITCH 
FABRICS 

CROSS-REFERENCE TO RELATED 
APPLICATIONS 

[0001] This application claims the benefit of U.S. Provi- 
sional Application No. 60/289,557 filed May 7, 2001 which 
is hereby incorporated by reference as if set forth in full 
herein. 

BACKGROUND 

[0002] The present invention relates generally to switch- 
ing devices, and more particularly to load balancing a switch 
system with multiple switching elements, including those in 
which switching elements are dynamically added or 
removed. 

[0003] Conventionally, network communication systems 
include multiple communication or network nodes intercon- 
nected together to provide high speed communication 
throughout the systems. These communication systems have 
become widely pervasive and are rapidly growing. How- 
ever, with this growth, the demand to provide information 
faster without any undue delay is also growing. Likewise, 
the demand to provide larger amounts of information is 
increasing. As such, communication nodes or devices are 
expected to operate quicker in order to provide information 
faster and/or accommodate large amounts of information, 
i.e., support an increased bandwidth. However, at times, 
providing information faster and support a large bandwidth 
are competing demands. 

[0004] Also, in order to meet these demands the ability for 
communication devices to be upgraded, adapted or replaced 
becomes a concern. Often times, communication devices are 
re-configured or replaced and thereby causing down-time, 
i.e., making a particular network inoperable for a period of 
time. Furthermore, the cost of upgrading and maintaining 
communication devices that are larger and faster may be cost 
prohibitive. In addition, communication devices that are 
under utilized become a waste of resources and in some 
cases may obviate the need to expand or replace a commu- 
nication device to provide a particular bandwidth and/or 
speed. 

[0005] However, in providing more information faster 
communication devices are also expected to maintain a 
particular quality of service and reliability. In other words, 
not only does information need to be sent and received, but 
the information should be sent within a specific time frame 
with minimal errors. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0006] FIG. 1 illustrates one embodiment of the load 
balancing system of the present invention; 

[0007] FIG. 2A illustrates one embodiment of the load 
balancer of the present invention; 

[0008] FIG. 2B illustrates one embodiment of a queue; 

[0009] FIG. 3 illustrates an overview of the load balanc- 
ing process of the present invention; 

[0010] FIG. 4 illustrates one embodiment of the load 
balancing process of the present invention; and 



[0011] FIG. 5 illustrates one embodiment of the determi- 
nation of the operating conditions process of the present 
invention. 

SUMMARY OF THE INVENTION 

[0012] The present invention provides a load balancing 
method and system for a switch system with multiple 
switching elements or network nodes. In one embodiment, 
the load balancing system includes a plurality of crossbar 
devices and a plurality of queues. The plurality of queues are 
configured to receive data. The system also includes a load 
balancer coupled to the plurality of queues and configured to 
determine an amount of data in each of the plurality of 
queues and to send the data to specific ones of the plurality 
of crossbar devices based on the amount of data in each 
queue. In one aspect of the invention, the plurality of queues 
includes a high priority queue and a plurality of non-high 
priority queues. Also, in another aspect of the invention, the 
load balancer sends the data to specific crossbar devices of 
the plurality of crossbar devices based on the amount of data 
in the high priority queue relative to the amount of data in 
each of the plurality of non-high priority queues. Further- 
more, in a further aspect of the invention, the load balancer 
is configured to detect inoperable crossbar devices and to 
detect additional crossbar devices added to the plurality of 
crossbar devices. 

[0013] In another embodiment, a load balancing method is 
provided in which a plurality of data is received and stored 
in a plurality of queues. Each data of the plurality of data is 
placed in a specific queue of the plurality of queues based on 
a priority associated with each data. The occupancy levels in 
each of the plurality of queues is determined and the data is 
transmitted to a plurality of crossbar devices based on the 
determined occupancy levels in each queue. 

[0014] In a further embodiment, a load balancing system 
is provided that includes a switching element means. The 
load balancing system also includes a first holding means for 
receiving and storing high priority data and a second holding 
means for receiving and storing non-high priority data. The 
load balancing system further includes a balancing means 
for determining an occupancy level of the first and second 
storing means and sending data to specific switching ele- 
ment means based on the determined occupancy level of the 
first storing means in re la don to the determined occupancy 
level of the second storing means. 

[0015] Many of the attendant features of this invention 
will be more readily appreciated as the same becomes better 
understood by reference to the following detailed descrip- 
tion and considered in conjunction with the accompanying 
drawings. 

DETAILED DESCRIPTION 

[0016] Crossbar devices are specialized communication 
devices that provide communication paths from multiple 
inputs to multiple outputs. Each communication path has a 
predefined capacity that represents the maximum amount of 
information or data that can be accommodated or supported 
by the communication path. Crossbar devices are often 
found in network devices or nodes, such as switches or 
routers, that receive information and reconfigure a commu- 
nication path in order to send the received information to a 
recipient designated by the received information. The 
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present invention, generally, allows for maximum utilization 
of the crossbar devices and the addition of crossbar devices 
without user intervention and down-time. 

[0017] FIG. 1 shows one embodiment of a switch con- 
figuration of the present invention. Multiple data streams are 
routed through a node. The node includes network and/or 
packet processors 11 and, in some embodiments, various 
other specialized processors or circuitry for performing 
various processes associated with transmission, reception, 
and routing of, for example, ATM or SONET traffic. The 
processors process data incoming to and outgoing from the 
node. In one embodiment, the processors perform network 
traffic management functions. Within the node, the proces- 
sors provide data to and receive data from transceivers, with 
the data being provided to the transceivers 13, for example, 
over a common switch interface (CSIX)-L1 compliant link. 

[0018] The transceivers are coupled to multiple crossbar 
devices 15 in parallel. In one embodiment, each of the 
transceivers include a plurality, such as four, i/o ports 
available for coupling to the crossbars. The crossbars route 
data from one transceiver to another. The use of multiple 
crossbar devices allows for increased data throughput, as 
well as for redundancy in the event of failure of a crossbar 
device. 

[0019] In operation, the switch in one embodiment is 
under centralized control of a processor. The processor 
receives routing requests and priority requests from trans- 
ceivers, allocates crossbars, and provides configuration 
commands to the crossbars. In other embodiments, the 
transceivers perform distributed control of the switch. 

[0020] The crossbars and the transceivers may have dif- 
ferent clock domains. In such a configuration various align- 
ment methods may be used, for example such as those 
described in U.S. Pat. Application HIGH SPEED CROSS 
POINT SWITCH ROUTING CIRCUIT WITH WORD- 
SYNCHRONOUS SERIAL BACK PLANE, application 
Ser. No. 09/129,662; Filed Aug. 5, 1998, the disclosure of 
which is incorporated in its entirety herein. 

[0021] The switch includes an automatic load balancing 
function. In one embodiment, a load balancer 17 performs 
the load balancing function. This load balancer can be used 
with 1 to N crossbar switch fabric devices. The load balancer 
determines which crossbar devices are active and evenly 
distributes the data traffic across these devices. This allows 
easy fabric bandwidth upgrades by simply adding more 
crossbar devices. In addition, redundant crossbar devices 
that are normally left unused, can add extra fabric bandwidth 
during normal operation. 

[0022] The load balancer is also configured to automati- 
cally stop sending data to a failed crossbar device. In one 
embodiment the transceivers determine whether a crossbar 
device is operational by monitoring responses to an initial- 
ization command sent to the crossbar via one of the i/o ports. 
If the crossbar responds to the initialization command and is 
able to perform alignment functions, if necessary, the cross- 
bar is available for use. If the crossbar is available for use, 
the crossbar is allocated for data transmission by the load 
balancer. If the crossbar is not available for use, the crossbar 
is not allocated for data transmission by the load balancer. 

[0023] The transceiver is able to determine when a cross- 
bar is no longer available through a variety of mechanisms, 



such as periodic status messages provided by the crossbar. In 
one embodiment, however, the crossbars receive as part of 
the data transmissions periodic word or frame alignment 
information. The crossbars perform, for example, frame 
alignment using the alignment information, and generate an 
out of frame signal (OOF) (or more often a not OOF signal). 
When the transceivers receive an indication that a crossbar 
is not in alignment, the crossbar is treated as unavailable by 
the load balancer. In other embodiments, framing, for 
example, is performed by receiving transcievers, and the 
receiving transceivers provide indications of lack of align- 
ment, as well as an indication of which crossbar is providing 
the out of alignment data. 

[0024] Thus, to upgrade the fabric bandwidth by adding 
more crossbar devices, the system need not be reconfigured 
through software control. Multiple crossbar devices are also 
used for redundancy. Moreover, redundant crossbars may be 
used during normal system operation as load balancing 
results in the use of the remaining active crossbars. 

[0025] In one embodiment, the load balancing function 
works as follows. Ingress queues shown in FIG. 2A can 
have from 0 to N frames ready for transmission to multiple 
crossbar devices, in this case four, coupled to links A, B, C 
or D. In the example of FIG. 2A, the ingress queues receives 
frames from a de-multiplexer 27 which receives a data 
stream. The queues and the links A, B, C, and D are 
associated with a path/device manager 29. As illustrated, the 
ingress queues include a high priority (HP) queue 25 and 
four non-HP queues 23a-d. In one embodiment, the frames 
from the non-high priority queues are provided to the 
path/device manager via a multiplexer 29. A low (L), 
medium (M) or high (H) threshold mark or capacity indi- 
cator is established for each queue. The occupancy level or 
the amount of information, e.g., the number of frames, in 
each queue determines the order in which data is to be 
transmitted across the four crossbar devices as shown, for 
example, in Table 1. 
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[0026] Examination of a simple case when only one queue 
has data is instructive. If the queue holds four or more 
frames, crossbar connections are requested from all four 
crossbar devices. If the queue only holds three frames, 
connections are requested from only three crossbar devices 
etc. If there are less then 4 frames in the queue, the crossbar 
devices are selected using a round robin method. 
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[0027] In most cases, more then one queue will hold data 
frames. Queues, in various embodiments, have different 
priorities and different threshold marks allowing the user to 
tailor the bandwidth from each queue as shown in the table 
above. 

[0028] If one of the crossbar devices has failed or is pulled 
from the system, the load balancer can detect this out of 
frame condition through the links labeled A, B, C and D. If 
one of these devices coupled to these links is out of frame, 
the load balancer will no longer send connection requests or 
data to this crossbar device. This mechanism is automatic 
and requires no user or software intervention. 

[0029] Thus, the switch fabric can be easily upgraded by 
adding more crossbar devices in parallel, avoiding system 
downtime. In addition, the switch fabric does not need to 
identify or dedicate crossbar devices for redundancy. All 
crossbar devices in the fabric can be actively used. 

[0030] FIG. 2B illustrates one embodiment of a queue of 
the present invention. The queue is divided into portions, in 
this case, a first portion 31a, a second portion 31b and a third 
portion 31c. When data is first received it is placed in the 
first portion of the queue. Once the first portion of the queue 
is completely filled the received data is placed in the second 
portion of the queue. Likewise, when the second portion of 
the queue is completely filled with data, the received data is 
placed in the third portion of the queue. In other words, data 
will be placed in a succeeding portion of the queue once the 
previous portion of the queue is filled. In one embodiment, 
corresponding portion indicators 33a, b, c, are associated 
with the first portion, the second portion, and the third 
portion, respectively. The portion indicators indicate that 
data is in one of the three portions. 

[0031] In one embodiment, the portion indicators are bits 
in a portion register. In this embodiment, if, for example, the 
second portion indicator 33ft is high or set to a logic level 
one, this indicates that data is in the second portion of the 
queue. The portion register, in one embodiment, includes 
numerous portion indicators that corresponds to numerous 
portions for each of the queues. By using the portion 
indicators for determining the occupancy level or the 
amount of data or information in each queue, the order in 
which data is transmitted to the cross-bar devices is deter- 
mined. Similar to the determination shown in Table 1, the 
information may be transmitted from a queue to a particular 
device via a link by using the first portion indicator for the 
low capacity indicator, the second portion indicator for the 
medium capacity indicator and the high capacity indicator 
for the third portion indicator. 

[0032] In various embodiments, the queue is also divided 
into more than three portions or less than three portions 
and/or has corresponding portion indicators associated with 
each portion of the queue. Also, the size of each portion, e.g., 
the amount of frames capable of being in each portion, may 
vary and may be modified and/or configured by a user or the 
load balancer. Furthermore, the boundaries of the portions 
are set, determined and/or modified by a user or the load 
balancer, for example, by adjusting the association of the 
portion indicators to the corresponding portion. 

[0033] In FIG. 3, an overview of the load balancing 
function is illustrated. In block 51, the process determines 
which crossbar devices are operational. In block 53, the 



process determines the occupancy level of the high priority 
queue. In block 55, the process determines the occupancy 
level of the non-high priority queue. In block 57, the process 
arranges for the transfer of information from the high 
priority queues and the non-high priority queues to the 
operational crossbar devices based on the occupancy levels 
of the queues and then the process ends. 

[0034] In FIG. 4, one embodiment of the load balancing 
function is illustrated. In block 101, the process determines 
the operational conditions of the crossbar devices. The 
number of crossbar devices that are operational is recorded. 
For example, the total number of crossbar devices may be 
five and the number of operational crossbar devices may be 
three, e.g., crossbar devices 1, 2 and 3. In block 103, the 
process determines the occupancy level of the high priority 
queue. In one embodiment, the process reads an indicator 
register of the high priority queue to determine the occu- 
pancy level of the queue, e.g., high, medium, low or empty. 
In block 105, the process determines if the occupancy level 
of the high priority queue is high. If the process determines 
that the occupancy level of the high priority queue is high, 
the process in block 117 assigns or arranges that the infor- 
mation in the high priority queue is transferred to all the 
operational crossbar devices determined in block 101. 

[0035] If the process determines that the occupancy level 
of the high priority queue is not high, the process in block 
107, determines if the occupancy level of the high priority 
queue is empty. In one embodiment, the process determines 
that the occupancy level is not high, medium or low to 
determine that the queue is empty. If the process determines 
that the high priority queue is empty, the process arranges 
that the information in the non-high priority queues are 
transferred to all the operational crossbar devices in block 
119 and the process ends. 

[0036] If the process determines that the high priority 
queue is empty, the process in block 109 determines the 
occupancy level of the other non-high priority queues. In 
one embodiment, the process accesses/reads an indicator 
register of the high priority queue to determine the occu- 
pancy level of the queue, e.g., high, medium, low or empty. 
In block 111, the process determines if the other queues are 
empty. If the process determines that the other queues are 
empty, the process arranges to transfer information from the 
high priority queue to all the operational crossbar devices, in 
block 117, and the process ends. In one embodiment, the 
process does not end, but continues, repeating over again at 
block 101 until commanded externally to end, such as by a 
shutdown command provided by a user or a control hard- 
ware/software. 

[0037] If the process, in block 111, determines that the 
other queues are not empty, the process in block 113 
arranges to transfer information from the high priority queue 
to some of the operational crossbar devices, e.g., crossbar 
devices A and B. The process in block 115 arranges to 
transfer information from the other non-high priority queues 
to the other remaining operational crossbar devices, e.g., 
crossbar device C, and then the process ends. In one embodi- 
ment, the process in blocks 113 and 115 determines the 
arrangement of the transfer of information from the high 
priority and non-high priority queues to the operational 
crossbar devices based on the following: 

[0038] When the high priority queue has an occupancy 
level of medium or low, for each occupancy level (from high 
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to low) of the non-high priority queues the process allocates 
one less crossbar devices. For example, if the occupancy 
level is high for the non-high priority queues, the process 
allocates four crossbar devices, for a medium occupancy 
level, three crossbar devices, and for a low occupancy level, 
two crossbar devices. The occupancy level of the high 
priority queue determines the starting point or total number 
of crossbar devices that the process allocates to the non-high 
priority queues. Using the above example, if the occupancy 
level of the high priority queue is low, the total number of 
crossbar devices is four and for a medium occupancy level, 
the total number of crossbar devices is three. If the total 
number of operational crossbar devices is four, then in the 
above example, the information in the high priority queue is 
not transferred to a crossbar device until the occupancy level 
of the non-high priority queues reduces from high to 
medium. 

[0039] In one embodiment, redundancy is employed by 
utilizing an additional spare or a complete set of spare links 
or crossbar devices (e.g., for each available link or crossbar 
device another link or crossbar device acting as a backup is 
provided) dedicated to handle the transfer of information if 
a link or crossbar device fails. As such, the switch-over from 
one link or crossbar device that failed to the additional 
(redundant) link or crossbar device is performed quickly to 
cause minimal or no down-time by the switch. In another 
embodiment, redundancy is employed utilizing all the avail- 
able links or crossbar devices that are operational. If a link 
or crossbar device fails, information transfer continues using 
the number of available links or crossbar devices, however, 
the number of available links or crossbar devices is now 
reduced by one, i.e., minus the failed link or device. Thus, 
the transfer of information is maintained and thereby caus- 
ing minimal or no down-time to the switch. 

[0040] Similarly, in one embodiment, scaleability is 
employed by again utilizing all the available links or cross- 
bar devices that are operational. If a link or crossbar device 
is added, information transfer continues using the number of 
available links or crossbar devices, however, the number of 
available links or crossbar devices is now increased by one 
or more, i.e., adding the new links or devices. Hence, 
transfer of information is maintained and thus causing 
minimal or no down-time to the switch. 

[0041] FIG. 5 illustrates an embodiment of the process of 
determining the operating conditions of the available cross- 
bar devices or links. In block 201, the process transmits 
predetermined data, frames or packets, to each crossbar 
device or link. In one embodiment, the predetermined data 
comprise a status command, framing packets and/or infor- 
mation to be transmitted on the device or link. In one 
embodiment, the load balancer sends the data to the devices 
or links. In block 203, the process detects any operational 
responses from the crossbar devices or links in response to 
the predetermined data. In one embodiment, the process 
waits for a predetermined response such as an out of frame 
error, a status response or an error message, to indicate that 
a particular device or link or a set of devices or links are not 
operating or available to receive information. In another 
embodiment, the process waits for a specific period of time 
and if no response is received the process determines that an 
operational response has occurred. 

[0042] If, in block 203, the process detects an operational 
response, the process updates an operational list of crossbar 



devices in block 205. In one embodiment, the operational 
list comprises a list of available crossbar devices or available 
links and the process updates the operational list by remov- 
ing the crossbar device or link from the list when an 
operational response, e.g., an out of frame condition, is 
detected in block 203. The operational list, in one embodi- 
ment, is utilized by the load balancer or process both 
previously described to determine the operational conditions 
of the devices or links for the transmission of information. 
In one embodiment, the process updates the list by disabling 
a device or link when a non -operational link or device is 
detected by, for example, marking or otherwise indicating in 
the list that the device is not operational. Also, in one 
embodiment, the mark or indication is recognized by the 
load balancer or the process to determine the operational 
conditions of the devices or links. If the process does not 
detect an operational response, the process continues to 
block 207. 

[0043] In block 207, the process determines if additional 
devices or links are available. In one embodiment, the 
process determines if additional devices or links are avail- 
able by transmitting predetermined data to a location where 
an additional device or link might be. In various embodi- 
ments, the location where an additional device or link is 
expected to be is a predefined offset from the last operational 
crossbar device or link detected, a specific address location 
or shared memory- space where all the devices and links are 
provided a specific address/memory space or location, or an 
additional entry in the operational list. If, in block 207, the 
process does not detect an operational response in response 
to the transmitted data, the process updates the operational 
list in block 209. The process then ends. In one embodiment, 
the process adds the additional link or device located to the 
operational list. If, in block 207, the process does detect an 
operational condition indicating that a new or additional 
device or link is not operating, e.g., provides an out of frame 
error, then the process ends. 

[0044] In one embodiment, operations performed by the 
process in blocks 207-209 are combined with blocks 201- 
205. For instance, the process in block 201 transmits pre- 
determined data to each device or link and additionally to a 
location where an additional device or link might be. In 
block 203, the process detects an operational response, e.g., 
an out of frame condition, in response to the transmitted data 
from each of the devices or links and any possible additional 
new devices or links. In block 205, the process updates the 
operational list, e.g., removing or disabling devices or links 
from the list that are not operational, e.g., having an out of 
frame condition, and/or adding devices or links to the list 
that are new and operational. In one embodiment, the 
process continually repeats at predetermined intervals, such 
as once a day or once a week, or at specific predetermined 
times, such as at initialization or a scheduled maintenance. 
In another embodiment, the process does not send or trans- 
mit predetermined packets, but waits to be interrupted or 
polls for a response from the devices or links as to the 
operational condition of each device or link. 

[0045] In one embodiment, the operational response indi- 
cates that a particular device or link or a set of devices or 
links are operational or available to receive information. 
Thus, in this embodiment, in blocks 205 and 209, if the 
process detects an operational condition, the process, in one 
embodiment, updates the operational list by adding the 
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device or link that provided the response or the device or 
link indicated as being operational by the response. Like- 
wise, if the process does not detect an operational condition, 
the process removes or disables the device or link in the 
operational list. Also, in one embodiment, the process in 
block 205 may be configured to detect an operational 
condition that indicates that a particular device or link or set 
of devices or links are operational or available to receive 
information and in block 209, the process may be configured 
to detect an operational condition that indicates that a 
particular device or link or set of devices or links are not 
operational or not available to receive information or vice 
versa. 

[0046] Accordingly, the present invention provides an 
automatic load balancer. Although the invention has been 
described in certain specific embodiments, it should be 
recognized that those of skill in the art would recognize 
other insubstantially different ways to implement the present 
invention. Thus, the present embodiments should be con- 
sidered exemplary only, with the invention defined by the 
claims and their equivalents supported by this disclosure. 

What is claimed is: 

1. A load balancing system for network nodes, the load 
balancing system comprising: 

a plurality of crossbar devices; 

a plurality of queues configured to receive data; and 

a load balancer coupled to the plurality of queues and 
configured to determine an amount of data in each of 
the plurality of queues and to send the data to specific 
ones of the plurality of crossbar devices based on the 
amount of data in each queue. 

2. The load balancing system of claim 1 wherein the 
plurality of queues comprises a high priority queue and a 
plurality of non-high priority queues. 

3. The load balancing system of claim 2 wherein the load 
balancer sends data to specific crossbar devices of the 
plurality of crossbar devices based on an amount of data in 
the high priority queue relative to an amount of data in each 
of the plurality of non-high priority queues. 

4. The load balancing system of claim 2 wherein the load 
balancer sends data to specific crossbar devices in a order 
that is based on one of the amount of data in the high priority 
queue and an amount of data in each of the plurality of 
non-high priority queues. 

5. The load balancing system of claim 2 wherein the load 
balancer sends data to specific crossbar devices of the 
plurality of crossbar devices based on an amount of data in 
each of the plurality of non-high priority queues relative to 
an amount of data in the high priority queue. 

6. The load balancing system of claim 2 wherein the load 
balancer sends data to specific crossbar devices in a order 
based on one of an amount of data in the high priority queue 
and an amount of data in each of the plurality of non-high 
priority queues. 

7. The load balancing system of claim 1 further compris- 
ing a capacity indicator identifying the amount of data in 
each queue. 

8. The load balancing system of claim 7 wherein the load 
balancer is configured to determine the amount data in each 
queue based on examining the capacity indicator and to 



transmit data to the plurality of crossbar devices in a 
predetermined order based on the examination of the capac- 
ity indicator. 

9. The load balancing system of claim 7 wherein the 
capacity indicator further indicates an occupancy level based 
on the amount of data in each queue. 

10. The load balancing system of claim 9 wherein load 
balancer is configured to transmit data to the plurality of 
crossbar devices in a predetermined order based on various 
occupancy levels of each queue as indicated by the capacity 
indicator. 

11. The load balancing system of claim 10 wherein the 
occupancy levels are high, medium, low and empty. 

12. The load balancing system of claim 1 wherein each of 
the queues are divided into a plurality of portions having a 
corresponding portion indicator for each portion of the 
queues to identify that data are in a corresponding portion of 
a queue. 

13. The load balancing system of claim 12 wherein the 
portion indicators are modifiable to indicate various occu- 
pancy levels in the queue. 

14. The load balancing system of claim 12 wherein the 
load balancer is configured to transmit data from the plu- 
rality of queues to the plurality of crossbar devices in a 
predetermined order based on the portion indicators. 

15. The load balancing system of claim 2 wherein each of 
the queues are divided into a first portion, a second portion 
and a third portion. 

16. The load balancing system of claim 15 wherein the 
data received are placed first in the first portion, when the 
first portion is full, the received data are placed in the second 
portion and, when the second portion is full, the received 
data are placed the third portion of the queue. 

17. The load balancing system of claim 15 wherein the 
load balancer, upon determining that data are in the third 
portion of the queue of the high priority queue, causes the 
data in the high priority queue to be transmitted to all the 
plurality of crossbar devices that are available. 

18. The load balancing system of claim 15 wherein the 
load balancer, upon determining that data are in the first 
portion of the queue of the high priority queue, causes the 
data in the third portion of the non-high priority queues to be 
transmitted to all the plurality of crossbar devices that are 
available. 

19. The load balancing system of claim 15 wherein the 
load balancer, upon determining that data are only in the 
non-high priority queues, causes the data in the non-high 
priority queues to be transmitted to all the plurality of 
crossbar devices that are available. 

20. The load balancing system of claim 15 wherein the 
load balancer, upon determining that data are only in the 
high priority queues, causes the data in the high priority 
queues to be transmitted to all the plurality of crossbar 
devices that are available. 

21. The load balancing system of claim 15 wherein the 
load balancer, upon determining that data are in one of the 
first and second portions of the queue of the high priority 
queue, causes the data in the non-high priority queues to be 
transmitted to particular predetermined crossbar devices that 
are available and causes the data in the high priority queue 
to be transmitted to remaining crossbar devices from the 
plurality of crossbar devices that are available. 
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22. The load balancing system of claim 15 further com- 
prising: 

a first indicator identifying that data are in the first portion 
of a queue; 

a second indicator identifying that data are in the second 
portion of a queue; and 

a third indicator identifies that data are in the third portion 
of a queue. 

23. The load balancing system of claim 15 wherein the 
load balancer, upon determining that the third indicator 
identifies that data are in the third portion of the queue of the 
high priority queue, causes the data in the non-high priority 
queues to be transmitted to all the plurality of crossbar 
devices that are available. 

24. The load balancing system of claim 1 wherein the load 
balancer is configured to detect inoperable crossbar devices. 

25. The load balancing system of claim 24 wherein the 
load balancer in detecting inoperable devices comprises 
sending a message to the plurality of crossbar devices and 
receiving a response sent from each of the plurality of 
crossbar devices that are operating. 

26. The load balancing system of claim 24 wherein the 
load balancer in detecting inoperable devices comprises 
sending a message to the plurality of crossbar devices and 
determining if a response sent from each of the plurality of 
crossbar devices that are operating based on a predetermined 
time frame. 

27. The load balancing system of claim 24 wherein the 
load balancer is configured to detect additional crossbar 
devices added to the plurality of crossbar devices. 

28. The load balancing system of claim 27 wherein the 
load balancer in detecting additional crossbar devices com- 
prises sending a data to at least one predetermined location 
and receiving a response sent from each of the additional 
crossbar devices that are added. 

29. The load balancing system of claim 27 wherein the 
load balancer in detecting additional crossbar devices com- 
prises receiving a data sent from each of the additional 
crossbar devices that are added. 

30. The load balancing system of claim 25 wherein the 
predetermined location is an offset from one of the plurality 
of crossbar devices. 

31. The load balancing system of claim 1 further com- 
prising a processor coupled to the load balancer. 

32. A load balancing method comprising: 

receiving a plurality of data; 

storing the plurality of data in a plurality of queues, each 
data of the plurality of data being placed in a specific 
queue of the plurality of queues based on a priority 
associated with each data; 

determining occupancy levels in each of the plurality of 
queues; and 

transmitting the data to a plurality of crossbar devices 
based on the determined occupancy levels in each 
queue. 



33. The load balancing method of claim 32 wherein the 
plurality of queues comprises a high priority queue and a 
plurality of non-high priority queues. 

34. The load balancing method of claim 33 wherein the 
occupancy levels in each queue are based on an amount of 
data in the high priority queue and an amount of data in each 
of the plurality of non-high priority queues. 

35. The load balancing method of claim 33 wherein 
transmitting the data, the data are transmitted to specific 
crossbar devices in an order that is based on an occupancy 
level of the high priority queue and an occupancy level in 
each of the plurality of non-high priority queues. 

36. The load balancing method of claim 32 further com- 
prising dividing the queues into a plurality of portions 
having a corresponding portion indicator for each portion of 
the queues to identify that data are in a corresponding 
portion of a queue. 

37. The load balancing method of claim 32 further com- 
prising modifying the portion indicators to indicate various 
occupancy levels in the queue. 

38. The load balancing method of claim 32 wherein 
transmitting the data, the data are transmitted to specific 
crossbar devices in an order based on the portion indicators. 

39. The load balancing method of claim 33 further com- 
prising: 

dividing each of the queues into a first portion, a second 
portion and a third portion; and 

determining if data are in the first, second and third 
portions of the plurality of queues. 

40. The load balancing method of claim 39 wherein the 
transmitting of the data further comprises transmitting the 
data in the high priority queue to all the plurality of crossbar 
devices, upon determining that data are in the third portion 
of the queue of the high priority queue. 

41. The load balancing method of claim 39 wherein the 
transmitting of the data further comprises transmitting the 
data in the third portion of the non-high priority queues to all 
the plurality of crossbar devices, upon determining that data 
are in the first portion of the queue of the high priority queue. 

42. The load balancing method of claim 39 wherein the 
transmitting of the data further comprises transmitting the 
data in the non-high priority queues to all the plurality of 
crossbar devices, upon determining that data are only in the 
non-high priority queues. 

43. The load balancing method of claim 39 wherein the 
transmitting of the data further comprises transmitting the 
data in the high priority queues to all the plurality of crossbar 
devices, upon determining that data are only in the high 
priority queues. 

44. The load balancing method of claim 39 wherein the 
transmitting of the data further comprises transmitting the 
data in the non-high priority queues to particular predeter- 
mined crossbar devices and transmitting the data in the high 
priority queue to remaining crossbar devices from the plu- 
rality of crossbar devices, upon determining that data are in 
one of the first and second portions of the queue of the high 
priority queue. 
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45. The load balancing method of claim 32 further com- 
prising detecting one of inoperable crossbar devices and 
operable crossbar devices and transmitting data to the cross- 
bar devices based on the detecting of one of inoperable 
crossbar devices and operable crossbar devices. 

46. The load balancing method of claim 32 further com- 
prising detecting an operational condition of each of the 
plurality of crossbar devices and transmitting data to the 
crossbar devices based on the operational condition 
detected. 

47. The load balancing method of claim 32 further com- 
prising detecting additional crossbar devices added to the 
plurality of crossbar devices and transmitting the data to the 
detected additional crossbar devices. 



48. A load balancing system comprising: 
switching element means; 

first holding means for receiving and storing high priority 
data; 

second holding means for receiving and storing non-high 
priority data; and 

balancing means for determining an occupancy level of 
the first and second storing means and sending data to 
specific switching element means based on the deter- 
mined occupancy level of the first storing means in 
relation to the determined occupancy level of the 
second storing means. 

* * * * * 
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(57) ABSTRACT 

A network routing switch includes a crosspoint switch, a set 
of Ethernet I/O ports, a set of ATM I/O ports, a reassembly 
unit for converting ATM transmissions into Ethernet 
transmissions, and a segmentation unit for converting Eth- 
ernet transmissions into ATM transmissions. As an ATM I/O 
port receives cells of an ATM transmission from an external 
source it stores them until the transmission is complete and 
then sends the ATM transmission through the crosspoint 
switch either to a forwarding ATM port or to the reassembly 
unit depending on whether the ATM transmission is to be 
forwarded as an ATM or Ethernet transmission. When the 
reassembly unit receives an ATM transmission it converts it 
to an Ethernet transmission and forwards it through the 
crosspoint switch to a forwarding Ethernet I/O port. When 
an Ethernet I/O port receives an Ethernet packet from an 
external source, it sends the packet through the crosspoint 
switch either to a forwarding Ethernet port or to the seg- 
mentation unit, depending on whether the Ethernet packet is 
to be forwarded as an Ethernet or ATM transmission. When 
the segmentation unit receives an Ethernet packet it converts 
it to an ATM transmission and forwards it through the 
crosspoint switch to a forwarding ATM I/O port. 
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1 2 

NETWORK SWITCH PROVIDING PER Ethernet protocol, a reassembly unit must convert each ATM 

VIRTUAL CHANNEL QUEUING FOR transmission to Ethernet protocol before the transmission 

SEGMENTATION AND REASSEMBLY arrives at the switch. Conversely a segmentation unit must 

BACKGROUND OF THE INVENTION convert an Ethernet packet emerging from the switch to 

v m f th i 5 ^™ protocol before it can be forwarded via a high speed 

Zl invention . ATM channel. Thus segmentation and reassembly units are 

Tlie present invention relates m general to a network f ircd at each Elhemet switcfa that {s tQ 

routing switch for routing both ATM and Ethernet protocol y ^ pro tocol 

data transmissions between stations of a computer network, p 

and in particular to a routing switch that translates between A segmentation unit is relatively simple and requires 

the two protocols. 10 relatively little storage capacity because it need only process 

2. Description of Related Art one Ethernet packet at a time. However since cells of many 

Ethernet and asynchronous transfer mode (ATM) proto- hundreds) of different ATM transmissions may 

cols are widely used to convey data between network amve at a ^assembly unit interleaved, a reassembly unit 

stations. A station in an Ethernet network sends data to other „ ^ res much lo 8 lc a°°memory to sort out the interleaved 

stations in the form of a variable length Ethernet packet 15 ^ transm^stons and to store the cells of each ATM 

including, in addition to a payload of 46 to 1500 bytes of transmission untd they can be assembled mto an Ethernet 

data, fields indicating its network source and destination P acket - ^ each rea ^ bly " mt ?? mKS icc T t0 a ,ar * e 

addresses and other information. An Ethernet packet must be memo 7 for st0 " n S ATM cells of large numbers of data 

sent and received as a continuous transmission and cannot , n tran f m f f°* » d ako needs lo S> c f ° r kee P ln S 

be interleaved with other transmissions. 20 track ° f « he . ArM transmlsslons tha ! lt lsstorm f Due ^ the 

» » r , j . , . comphcated nature of the reassembly unit and its need for a 

A station m an ATM network segments a data transmis- 1*1 \ / i_i 

. . ... . - AQ u ° ™ . c . A _ . large, external memory, a segmentation/reassembly umt 

sion into a variable number of 48 -bit ATM cells. Each ATM 0 , . ... . . ' , j~ 4 • u • « 1 

..... /. « « j , , , capable of handling high speed data traffic is physically large 

cell includes a fixed length data payload, a virtual path j • o- « * *• < 

■j 7c ^™ • i t. 1 . . i.'c ,xrr>i\ a and requires numerous I/O pins. Smce such a segmentation/ 

identifier (VPQ a virtual channel identifiei <yCT) and other 25 reassemb , ^ ta J tded for each of sev s efal switcfa 

mformation. All ATM cells formmg an ATM transmission ' swi ( ch f AIM/Btomet translation 

have the same VCI and VPI identifiers. The VPI field f j^* g and ex ng . ve r 

identifies a particular virtual path that the cells of the s ™ 

transmission are to follow through the network. Each virtual ^ since ^ routlQ g swltch operates only m one 

path may have several virtual channels. The VCI field 30 protocol, an ATM packet destined for an ATM system must 

identifies the particular virtual channel within a virtual path first be translated into Ethernet protocol, sent through the 

to which a transmission is assigned. When two ATM trans- Ethernet protocol routing switch, and then converted back 

missions are assigned to the same virtual channel of the ^ ArM protocol. The double translation of ATM packets 

same virtual path, one transmission must follow the other; sim P ] Y t0 move lhem throu S h an Ethcrnet P rotoco1 switch 

their cells may not be interleaved. An end of message 35 reduces s y stem throughput. 

(EOM) code included in the last cell of each transmission What is needed is a network switch that routes both ATM 

marks the boundary between that transmission and the next and Ethernet protocol transmissions, that provides segmen- 

transmission of the same virtual path and channel. Cells of tation and reassembly only when necessary, and which does 

several ATM transmissions assigned to the same virtual path not employ a large number of complicated segmentation/ 

may be interleaved on the same physical transmission fine at ^ reassembly units. 

any point along that path provided they are assigned to summary OF THF INVFNTIOM 

different virtual channels. The VPI field in the cells allows SUMMARY OF THE INVENTION 

a receiving network device to sort them by transmission. a network routing switch in accordance with the present 

In one respect Ethernet packets are more efficient than invention includes a crosspoint switch, a set of Ethernet I/O 
ATM transmissions because an Ethernet packet can contain 45 ports, a set of ATM I/O ports, a reassembly unit for con- 
more payload data per unit of routing information than an verting ATM transmissions into Ethernet transmissions, and 
ATM cell. However in networks where stations must com- a segmentation unit for converting Ethernet transmissions 
pete for high speed transmission channels, the ability to into ATM transmissions. As an ATM I/O port receives cells 
interleaved ATM cells allows more efficient use of data of an ATM transmission from an external source it stores 
transmission bandwidth. 50 them until the transmission is complete and then sends the 

It is sometimes advantageous for a network to employ ATM transmission through the crosspoint switch either to a 

both ATM and Ethernet protocols. For example a small forwarding ATM port or to the reassembly unit depending on 

cluster of closely associated network stations interconnected whether the ATM transmission is to be forwarded as an ATM 

by a network routing switch may communicate with one or Ethernet transmission. When the reassembly unit receives 

another via Ethernet protocol. However when routing 55 an ATM transmission it converts it to an Ethernet transmis- 

switches interconnecting such clusters of network stations sion and forwards it through the crosspoint switch to a 

are themselves interconnected by high speed channels, it forwarding Ethernet I/O port. When an Ethernet I/O port 

may be advantageous to employ ATM protocol when send- receives an Ethernet packet from an external source it sends 

ing data over those high speed channels. In such systems the packet through the crosspoint switch either to a forward- 

"segmentation" and "reassembly" units are used to convert 60 m g Ethernet port or to the segmentation unit depending on 

between Ethernet and ATM transmission protocols. A seg- whether the Ethernet packet is to be forwarded as an 

mentation unit segments a large Ethernet packet into a Ethernet or ATM transmission. When the segmentation unit 

sequence of smaller ATM cells. A reassembly unit "reas- receives an Ethernet packet it converts it to an ATM trans- 

sembles" a transmission of small ATM cells into one or more mission and forwards it through the crosspoint switch to a 

large Ethernet packets. 65 forwarding ATM I/O port. 

Prior art routing switches handle only one transmission It is accordingly an object of the invention to provide a 

protocol. Thus, for example, when a network switch uses network switch that can route both ATM and Ethernet 
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packets and which translates between ATM and Ethernet transmission to a local Ethernet station 14, a local ATM 

protocols only when necessary. station 16, or to another switch 12 via an Ethernet path 18 

It is another object of the invention to provide a network or an ATM path 20. If the Ethernet packet is to be forwarded 

switch that can translate between ATM and Ethernet packets t0 a l° ca l Ethernet station 14 or to a remote switch 12 via an 

using only a single segmentation unit and a single reassem- 5 Ethernet path 18, the receiving switch 12 forwards the 

bly unit, rather than a segmentation and reassembly unit at Ethernet packet without translation. However if the Ethernet 

each of several I/O ports. packet is to be forwarded to a local ATM station 16 or to a 

Hie concluding portion of this specification particularly * m ? te 12 v [ a an *™ P ath 20 > ^ receiving switch 

points out and distinctly claims the subject matter of the translates the Ethernet protocol transmission to an 

present invention. However those skilled in the art will best 10 ATM, protocol transmission before forwarding it to its des- 

understandboththeorganizationandmethodofoperationof tmatl0n - Conversely when a switch 12 receives an ATM 

the invention, together with further advantages and objects transmission it translates the ATM transmission to one or 

thereof, by reading the remaining portions of the specifica- mor f Ethernet packets only when it is to be forwarded to a 

tion in view of the accompanying drawing(s) wherein like ^ Ethernet station 14 or to a remote switch 12 via an 

reference characters refer to like elements. 15 Elneraet P ath 18 - 



BRIEF DESCRIPTION OF THE DRAWING(S) 

FIG. 1 illustrates a computer network employing a set of 
network routing switches in accordance with the present 20 
invention; 

FIG. 2 illustrates one of the routing switches of FIG. 1 in 
more detailed block diagram form; 

FIG. 3 illustrates a typical routing switch of FIGS. 1 and 
2 in more detailed block diagram form; 25 

FIG. 4A illustrates the structure of an ATM cell; 

FIG. 4B illustrates the structure of an Ethernet packet; 

FIG. 5 illustrates an ATM input port of FIG. 3 in more 
detailed block diagram form; 30 

FIG. 6 illustrates the data structures maintained by the 
input port of FIG. 5; 

FIG. 7 illustrates the ATM output port of FIG. 3 in more 
detailed block diagram form; 

FIG. 8 illustrates the ATM output port switch interface 35 
circuit of FIG. 5 and the ATM input port switch interface 
circuit of FIG. 7 in more detailed block diagram form; 

FIG. 9 illustrates the Ethernet input port of FIG. 3 in more 
detailed block diagram form; ^ 

FIG. 10 illustrates the Ethernet output port of FIG. 3 in 
more detailed block diagram form; 

FIG. 11 illustrates the input port switch interface of FIG. 
9 and the output port switch interface of FIG. 10 in more 
detailed block diagram form; 45 

FIG. 12 illustrates the arbiter of FIG. 3 in more detailed 
block diagram form; 

FIG. 13 illustrates the reassembly unit of FIG. 2 in more 
detailed block diagram form; and ^ 

FIG. 14 illustrates the segmentation unit of FIG. 2 in more 
detailed block diagram form. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS) 

FIG. 1 illustrates a computer network 10 employing a set 
of network routing switches 12 in accordance with the 
present invention. Switches 12 interconnect network stations 
14 employing Ethernet protocol and networks stations 16 
employing ATM protocol and provide the protocol transla- 60 
tion when necessary to allow stations 14 and 16 to commu- 
nicate with each other. Routing switches 12 are themselves 
linked either by high speed Ethernet communication paths 
18 or ATM communication paths 20. 

When an Ethernet station 14 transmits an Ethernet packet 65 
to a switch 12, the switch determines from routing data 
included in the transmission whether it must forward the 



Routing Switch Architecture 

FIG. 2 illustrates one of the routing switches 12 of FIG. 
1 in simplified block diagram form. Routing switch 12 
includes a crosspoint switch 20 interconnecting a set of 
Ethernet I/O ports 24, and a set of ATM I/O ports 26. 
Routing switch 12 also includes a segmentation unit 32 for 
converting Ethernet packets into ATM transmissions and a 
reassembly unit 34 for converting ATM transmissions into 
Ethernet packets. Crosspoint switch 20 provides communi- 
cation paths between I/O ports 24 and 26 and segmentation 
and reassembly units 32,34 in response to control data from 
a routing arbiter 28. 

When an ATM port 26 receives an incoming ATM trans- 
mission via an ATM bus, it stores each cell of the transmis- 
sion in an internal memory. Although cells of separate ATM 
transmissions may arrive interleaved at an ATM port 26, the 
port looks at data included in each incoming cell to deter- 
mine the transmission to which it belongs and stores the 
ATM cell accordingly. The ATM port 26 also determines 
from information included in the ATM cells of each trans- 
mission how the transmission is to be forwarded through 
switch 20. If the ATM transmission is to be forwarded 
through another ATM port 26, the receiving ATM port 26 
requests arbitrator 28 to route the transmission directly to the 
forwarding ATM port 26. When the forwarding ATM port 26 
is idle (not busy forwarding another transmission), arbitrator 
28 establishes a path between the receiving and forwarding 
ATM ports 26 through switch 20 and signals the receiving 
ATM port 26 to begin forwarding the ATM transmission to 
the forwarding ATM port 26. 

On the other hand, if an ATM port 26 determines that an 
incoming ATM transmission is to be forwarded outward 
from switch 12 in the form of one or more Ethernet packets, 
the receiving ATM port 26 requests arbitrator 28 to route the 
ATM transmission to reassembly unit 34. When the reas- 
sembly unit 34 is ready to receive the ATM transmission, 
arbitrator 28 establishes a path through switch 20 between 
the receiving ATM port 26 and reassembly unit 34 and 
signals the receiving ATM port to begin forwarding the ATM 
transmission to the reassembly unit. Reassembly unit 34 
then converts the ATM transmission to one or more Ethernet 
packets, determines which Ethernet port 24 is to forward the 
Ethernet packet(s), and then requests arbitrator 28 to route 
the Ethernet packet(s) through switch 20 to the forwarding 
Ethernet port 24. 

When an Ethernet port 24 receives an incoming Ethernet 
packet it stores the packet and determines whether the data 
conveyed in the packet is to be forwarded outward from 
switch IL via another Ethernet port 24 via an ATM port 26. 
When the packet is to be forwarded by an Ethernet port, the 
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receiving Ethernet port 24 requests arbitrator 28 to establish 
a path through switch 20 to the forwarding Ethernet port 24, 
and forwards the Ethernet packet when the path is estab- 
lished. When an Ethernet port 24 receives an Ethernet packet 
that is to be forwarded as an ATM transmission, the Ethernet 
port 24 requests arbitrator 28 to route the Ethernet packet 
through switch 20 to segmentation unit 32. Segmentation 
unit 32 then converts the Ethernet packet to an ATM 
transmission, determines which ATM port 26 is to forward 
the ATM transmission, and requests arbitrator 28 to route the 
ATM transmission to the forwarding ATM port 26. 

FIG. 3 illustrates routing switch 12 of FIGS. 1 and 2 in 
more detailed block diagram form. Switch 12 includes N 
ATM I/O ports and M Ethernet I/O ports. Each ATM I/O port 
is shown in FIG. 3 as including two components: one of 
ATM input ports PRA(0)-PRA(N) and a corresponding one 
ATM output ports PT>^0)-PTA(N). Each of the M Ethernet 
I/O ports is also shown as including two components: one of 
Ethernet input ports PRE(0)-PRE(M) and a corresponding 
one of Ethernet output ports PTE(0)-PTE(M). Incoming 
ATM transmissions arrive at input ports PRA(0)-PRA(N) 
via corresponding transmission lines RA(0)-RA(N) and 
incoming Ethernet packets arrive at input ports PRE(0)-PRE 
(M) via corresponding transmission lines RE(0)-RE(M). 
Outgoing ATM transmissions depart from output ports PTA 
(0)-PTA(N) via corresponding transmission lines TA(0)-TA 
(N) and outgoing Ethernet packets depart from output ports 
PTE(0)-PTE(M) via corresponding transmission lines 
TE(0)-TE(M). Segmentation unit 32 includes three compo- 
nents: an input interface 32A, an output interface 32B, and 
a processing section (not shown in FIG. 3) connected 
between its input and output interfaces. Similarly reassem- 
bly unit 34 also includes three components: an input inter- 
face 34A and an output interface 34B and a processing 
section (not shown in FIG. 3) connected between its input 
and output interfaces. 

Crosspoint switch 20 includes a set of "vertical" conduc- 
tors 45 connected to input ports PRA(0)-PRA(N) and PRE 
(0)-PRE(N) and to segmentation and reassembly unit output 
interfaces 32B and 34B. Switch 20 also includes a set of 
"horizontal" conductors 47 connected to input ports PRA 
(0)-PRA(N) and PRE(0)-PRE(N) and to the segmentation 
and reassembly unit input interfaces 32A and 34A. When 
turned on, a pass transistor 48 at each crosspoint between a 
horizontal conductor 47 and a vertical conductor 45 forms a 
signal path between the two conductors. Each bit of an 
M+N+2 bit data word at each address of a random access 
memory (RAM) 42 controls the on/off state of a separate one 
of the M+N+2 transistors 48 along one horizontal row of 
switch 20. To connect a particular input port to a particular 
output port, arbiter 28 writes one data word to RAM 42. That 
data word turns on one pass transistor 48 and turns off all 
others along the horizontal line 47 leading to the output port. 

Arbitrator 28 communicates with all ATM input and 
output ports PRA(0)-PRA(N) and PTA(0)-PTA(N), the 
reassembly unit input interface 34A and the segmentation 
unit output interface 32B through a parallel arbitration bus 
ARB_A. Using the ARB^A bus, arbitrator 28 polls the 
ATM input and output ports to determine when and where an 
input port wants to send an ATM transmission and to 
determine whether an ATM output port is ready to receive 
and forward an ATM transmission. Arbiter 28 also uses the 
ARB_A bus to poll segmentation unit output interface 32B 
to determine when and where it wants to send an ATM 
transmission it has created and to poll reassembly unit input 
interface 34A to determine when it is ready to receive and 
reassemble an ATM transmission. The information obtained 
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tells arbiter 28 how and when to route ATM transmissions 
through switch 20. 

In a similar manner arbitrator 28 uses another parallel 
arbitration bus ARB_E to poll all Ethernet input and output 
5 ports PRE(0)-PRE(M) and PTE(0)-PTE(M), segmentation 
unit input interface 32A and reassembly unit output interface 
34B in order to determine how and when to route Ethernet 
transmissions through switch 20. 

10 ATM Cell Format and Header Translation 

FIG. 4A illustrates the structure of an ATM cell. The fixed 
length ATM cell includes a virtual path identifier (VPI), a 
virtual channel identifier (VCI), an error check field (CRC) 

35 and a data field (PAYLOAD). Each ATM cell is a part of an 
ATM transmission made up of a sequence of such ATM cells 
and all ATM cells forming an ATM transmission have the 
same VCI and VPI identifiers. The VPI field identifies a 
particular virtual path that the cells of the transmission is to 

2Q follow through the network. The VCI field identifies a 
particular virtual channel of that path. Cells of several ATM 
transmissions assigned to the same virtual path may be 
interleaved if they are assigned to differing virtual channels 
because the VPI field in the cells distinguishes them by 

25 transmission. However, two ATM transmissions are 
assigned to the same virtual channel of the same virtual path, 
one transmission must follow the other; their cells may not 
be interleaved. An end of message (EOM) code included in 
the PAYLOAD field of the last cell of each transmission 

3Q marks the boundary between that transmission and the next. 
An ATM transmission can be routed from an ATM input 
port PRA(0)-PRA(N) or from segmentation unit output 
interface 32B either to one of ATM output ports PTA(0) 
-PTA(N) or to the reassembly unit input interface 34A. Each 

35 ATM and Ethernet input/out port PRA/PTA and PRE/PTE 
has a unique switch port ID. Segmentation and reassembly 
units 32,34 also have unique switch port IDs. Each ATM 
input port PRA(0)-PRA(N) and segmentation unit output 
interface 32B includes an internal lookup table relating each 

40 VPI value to the port ID of an ATM output port PTA(0) 
-PTA(N) or reassembly input unit 34B to which an ATM 
transmission having that VCI code is to be forwarded. The 
lookup table also indicates a relative forwarding "priority" 
for each combination of VCI and VPI. When an ATM input 

4 5 port PRA(0)-PRA(N) or the segmentation unit 34 is ready 
to forward an ATM transmission, it competes for a connec- 
tion to the destination ATM output port PTA(0)-PTA(N) or 
to reassembly input unit 34A. Arbitrator 28 grants connec- 
tions in order of forwarding priority. The RAM-based 

so lookup tables in the ATM output ports and in segmentation 
output unit 32B are maintained by an external host supply- 
ing translation data via a conventional memory bus 
(SYSTEM). 

55 Ethernet Packet Format and Address Translation 

FIG. 4B illustrates the structure of a standard Ethernet 
packet. An Ethernet packet includes a preamble, a start of 
frame delimiter (SOF), a destination address (DEST), a 
source address (SOURCE), a type field (TYPE), a data field 

60 (DATA) and an error check field (ERROR). All fields except 
the DATA field are fixed in length; the DATA field can range 
between 46 and 1500 bytes. Each Ethernet station in a 
network is identified by a unique network address. The 
destination address DEST contained in the packet indicates 

65 the network address of the station to receive the packet. The 
SOURCE address included in the packet indicates the sta- 
tion that sent the packet. 
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When an Ethernet input port PRE(0)-PRE(M) receives an data appearing on bus 57 to determine an address within cell 

incoming packet, it sends its own port ID, along with the buffer RAM 58 at which an incoming ATM cell on bus 52 

packet's SOURCE and DEST fields, to an address transla- is to be stored. When one of RAMs 60 reads that address 

tion system 44 (FIG. 3) via a parallel bus TRANS. Trans- onto address lines of bus 52 queue manager 54 sends a write 

lation system 44 maintains a lookup table relating each 5 signal to RAM 58 causing it to store the ATM cell and the 

network address of each network station to the switch port EOM bit, if any, appearing on bus 52 at that address. Queue 

ID of an Ethernet I/O port that communicates with that manager 54 then updates tables in RAMs 60 to record the 

station either directly or through another network switch. storage location of that ATM cell and to record the PORTID 

Address translation system 44 uses the incoming port ID and and PRIORITY levels associated with its transmission, 

source network address to update its lookup table. It also 10 ATM input port PRA(0) includes a switch interface circuit 

responds to the incoming destination network address by 62 providing a link between the input port to a vertical line 

returning the port ID of the switch output port PRE(0)-PTE 45 0 f crosspoint switch 20 of FIG. 3. Switch interface circuit 

(M) that can forward an Ethernet packet to the network 62 also provides a communication link between the ATM 

station identified by the destination address. Later, when it arbitration bus ARB_A and queue manager 54. Arbiter 28 

is ready to send the packet onward, the Ethernet input port is 0 f FIG. 3 periodically polls each ATM input port to deter- 

PRE(0)-PRE(M) arbitrates for the right to forward the mine if it is ready to forward a complete ATM transmission 

packet to the indicated Ethernet output port. When reassem- 0 f a particular priority to an ATM output port or to the 

bly unit 34 converts an ATM transmission into an Ethernet segmentation unit. To poll an input port, the arbiter 28 places 

packet, it also accesses address translation unit 44 via the the input port's ID on INPORT lines of the ARB_Abus and 

TRANS bus to obtain the port ID of the Ethernet output port 20 pulses a POLL signal. It also places a priority code on 

PTE(0>-PTE(M) to receive and forward the packet. PRIORITY lines of the ARB_A bus. If it is not busy 

Address translation system 44 distinguishes Ethernet forwarding an ATM transmission to an output port, switch 

ports from ATM ports. When an Ethernet input port PRE interface circuit 62 of the input port having the port ID 

(0)-PRE(M) has received a packet that must be converted to appearing on the INPORT lines places the priority value on 

an ATM transmission, and sends the packet's destination 25 lines of bus 57 and forwards the POLL signal to queue 

address to address translation system 44, the address trans- manager 54. Queue manager 54 then write accesses RAMs 

mission system returns the switch port ID of the segmenta- 60 to determine whether the port is ready to forward a 

tion input unit 32A instead of a switch port ID of a switch transmission having that priority. If so, queue manager 54 

port associated with the packet's destination address. This causes RAM 60 to read out the port ID of the output port (or 

causes the Ethernet input port PRE(0)-PRE(M) that 30 reassembly unit) that is to receive the transmission onto lines 

received the packet to request arbitrator 28 to forward the of bus 57. Switch interface circuit 62 puts the output port ID 

packet to segmentation unit 32 where it can be converted on OUTPORT lines of the ARB_BUS and asserts an 

into an ATM transmission. The external host writes the data ARB__A bus request line (REQ). Each output port monitors 

into translation table 44 via the SYSTEM bus that is needed the OUTPORT and REQ lines and in response to the REQ 

to relate a network destination address to the segmentation 35 ii ne , the output port having the ID appearing on the OUT- 

unit. PORT lines signals arbiter 28 (FIG. 3) as to whether it is idle 

ATM Input Port Architecture orb ^ ^* fra ™ n ' If the rc ^ esle d output port 

r is idle, the arbiter establishes a connection between the 

FIG. 5 illustrates ATM input port PRA(0) of FIG. 3 in requesting input port and the requested output port and 

more detailed block diagram form. Other ATM input ports 40 pulses a GRANT signal line of the ARB__A bus. Switch 

PR(1)-PR(N) are similar. ATM input port PRA(0) includes interface circuit 62 responds to the GRANT signal by 

a conventional ATM bus interface circuit 50 which receives pulsing a NEXT CELL signal to queue manager 54. Queue 

each ATM cell arriving on line RA(0), converts it to parallel manager 54 responds to each NEXT CELL signal pulse by 

form, and stores it in an internal FIFO buffer. When ready to causing RAM 58 to read out an ATM cell of the ATM 

forward a cell out of its FIFO buffer, bus interface circuit 50 45 transmission onto bus 52 to forward the cell to switch 

sends a NEXT CELL signal to a queue manager 52. When interface 62. Queue manager then sends an acknowledge 

ready to process the cell, queue manager 54 sends an signal [ACK] to switch interface circuit 62. Switch interface 

acknowledge (ACK) signal to bus interface circuit 50. circuit 52 then serializes, 4B5B encodes and forwards the 

Interface circuit 50 then places the cell on a bus 52. ATM bus C ell to the output port via vertical line 45 of the crosspoint 

interface 50 also parses the payload of the ATM cell to 50 switch. (4B5B encoding is discussed below.) When queue 

determine if it contains the end of message (EOM) code. If maD ager 54 sends the last cell of the transmission to switch 

it does, bus interface 50 sends an EOM signal to queue interface 62, an EOM bit appearing with the last ATM cell 

manager 54 and places an EOM bit on a line of bus 52. The on bus 52 tells switch interface 62 that the input port is to be 

VPIandVCI fields of the ATM cell on bus 52 provide input idle> forwarding the last cell, switch interface 62 

to a RAM-based lookup table 56 relating the VPI and VCI 55 awaits the next poll from the arbiter, 
data values to a unique connection ID (CONID), a priority 

level (PRIORITY) and the ID (PORTID) of an output port ATM Input Port Data Structures 

or reassembly unit to receive the ATM transmission. In FIG. 6 illustrates the data structures stored in RAMs 58 

response to a signal from queue manager 54, lookup table 56 and 60 of input port PRA(0) of FIG. 5. RAM 58 acts as an 

places its output PORTID, CONID, and PRIORITY data on 60 ATM cell buffer table 68, storing one ATM cell at each 

lines of a bus 57. address. Each ATM cell buffer containing an ATM cell also 

Queue manager 54 stores an ATM cell appearing on bus stores a next cell pointer "N" to a cell buffer containing a 

52 in a cell buffer RAM 58. A set of RAMs 60 implement next ATM cell of the same transmission items. Thus, the cell 

lookup tables which keep track of where in RAM 58 the buffers use their N pointers to form linked lists. Each linked 

ATM cells for each transmission are stored and the priority 65 list corresponds to a separate virtual channel and contains all 

of each transmission. Queue manager 54 read accesses the of the ATM cells assigned to that virtual channel in the order 

tables in RAMs 60 based on the CONID and PRIORITY they arrive at the port. Each cell buffer stores a bit "E" set 
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when the cell buffer contains such an EOM cell. All ATM buffer 86, and by forwarding the POLL signal to input port 

cell buffer locations not currently storing an ATM cell form queue manager 54 of FIG. 5. 

a separate "free cell linked list*'. The CON1D output at RAM Using the PRIORITY data (for example Priority«z) 

56 references a particular linked list to which the incoming appearing on the ARB_Abus to address the priority table 64 

cell is to be added. A connection table 66 implemented by 5 of FIG. 6, queue manager 54 causes the first connection table 

one of RAMs 60 addressed by the CON1D data contains an 66 entry for that priority (e.g. entry 1). If the input ports 

entry for the free cell liked list and one entry for each linked stores no completed ATM transmissions at that priority 

list in the ATM cell buffer table 68. Each connection table level * the E0M count wiU be 0. A comparator 88 in interface 

entry includes a pointer «F' to the first cell buffer of the circuit 62 compares the EOM count on bus 57 to 0 and 

corresponding linked list, a pointer "L" to the last cell buffer 10 si g Qals slate machine 80 accordingly. If the EOM count is 0, 

of the linked list, the ID (PORTID) of the output port to slale machine 80 pulses the NREQ line to indicate that is not 

which the ATM transmission is to be forwarded, a data value requesting a connection. If the EOM count is greater than 0, 

"Cell CNT" indicating the number of cells in the linked list, state machine 80 causes buffer 86 t0 P ut the 0Ut P ut P ort ID 

and data value EOM CNT indicating the number of com- rcadout of table 66 on the OUTPORT lines of the ARB_A 

plete ATM transmissions included in the linked list. is bus and P ulscs the RE< ^ line of the ARB_A bus to request 

^ * ui ♦ « i • a ■ 4 i- i a a connection to an output port. If the output port referenced 

Connection table 66 entries are also organized into linked . t . v f ai ™dti' • ji 

,. # e . -ui c a • ■ t i ah by the port ID now appearmg on the OUTPORT lines is idle, 

lists, one for each possible forwarding priority level. All ■/ , mTri . r»u ah-o a i_ *u u-, 

. . . *, . . . ' 4 , it pulses an IDLE line of the ARB_J^ bus causing the arbiter 

connections having the same priority are linked by the same . r 4 ... . <L . . . -.uin. .i_ 

»• ♦ ui v . a • '...ui ca' ^~ \ au to establish a connection through crosspomt switch 20 to the 

connection table list. A priority table 64 implemented by one f . & r 

of RAMs 60 addressed by a PRIORITY data value includes 20 reicreneea output port 

an entry for each priority level. Each priority table entry ^ f ,t Tk ^ P , «n 

includes pointers F and L to the first and last entry of the GRANT signal line of the ARB_A bus. State machine 80 

connection table 66 linked list for the corresponding priority TCS ^ b ? a °° de S enerator ™ 92 t0 transmit 

level. The manner in which queue manager 54 of FIG. 5 uses a 4B ? B f ncod 5? u start ° f transmission code sequence via 

and maintains the tables illustrated in FIG. 6 is discussed 25 a multiplexer 94 through crosspomt switch 20 to the output 

k e ^ Qw port. When the output port detects this sequence it prepares 

to receive the ATM transmission. State machine 80 then 

ATM Output Port Architecture » nds a NEXT_CELL signal to queue manager 54 causing 

queue manager 54 to send the first cell of the stored ATM 

FIG. 7 illustrates ATM output port PTA(0) of FIG. 3 in 30 transmission outward on bus 52 to a circuit 90. Queue 

more detailed block diagram form. Output ports PTA(1) manager 54 then pulses an acknowledge signal (ACK) input 

-PTA(N) are similar. Output port PTA(0) includes a switch t 0 state machine 80. State machine 80 than signals circuit 90 

interface circuit 50 for receiving serial transmissions of to serialize the cell and convert it to the well-known "4B5B" 

4B5B encoded, serialized ATM cells from crosspoint switch encoded form. In this form of encoding, each 4-bit nibble of 

20 of FIG. 3. Switch interface circuit 70 4B5B decodes and 35 the cell is converted into a 5-bit code. While many 5-bit 

deserializes each cell and stores it in a first-in, first-out 4B5B codes correspond to encoded 4-bit values, some 5 -bit 

(FIFO) buffer 72 by pulsing a shift-in(SI) input of buffer 72. 4B5B code values are reserved for use as various synchro- 

When not empty, FIFO buffer 72 sends a NOT_EMPTY nizing codes, start and end of message codes and the like, 

signal to an ATM bus interface circuit 74. ATM bus interface State machine 80 forwards each encoded ATM cell to output 

circuit 74 responds by strobing a shift out (SO) FIFO buffer ^ port switch interface circuit 70 via multiplexer 94 and switch 

52 control input causing the FIFO buffer to shift out a 20. Thereafter, queue manager 54 sends each cell of the 

longest stored ATM cell. ATM bus interface circuit 74 then ATM transmission to circuit 90 in response to each NEXT 

serializes the ATM cell and transmits it outward on ATM bus CELL signal pulse and state machine 80 responds to each 

TA(0) using ATM protocol. As discussed below, arbiter 28 ACK signal pulse by forwarding the encoded serialized cell 

of FIG. 3 polls switch interface circuit 70 via the ARBIT_A 45 from circuit 90 to the interface circuit 70 of the destination 

bus to determine when output port PTA(0) is idle, output port. When the last CELL of the transmission appears 

on BUS 52 the EOM bit stored with that cell appears on a 

ATM Arbitration ^ of bus s2 providing input to state machine 80. After 

FIG. 8 illustrates ATM output port switch interface circuit forwarding that cell to the output port, state machine 80 

62 of FIG. 5 and ATM input port switch interface circuit 70 50 signals code generator 92 and multiplexer 94 to send an end 

of FIG. 7 in more detailed block diagram form. To poll an of transmission code to the output port. The EOM bit also 

ATM input port, arbiter 28 of FIG. 3 places a priority level tells q ueue manager 54 to decrement the EOM count in the 

value and input port ID on the PRIORITY and INPORT lines connection table entry to update the free cell list and to 

of the ARB__A bus and pulses the POLL line of the ARB_A remove the freed ATM cell buffer table entries from the cell 

bus. A comparator 78 compares the input port's PORT ID 55 buffer linked list - 

with the input port ID on the INPUT lines and signals a state Output port switch interface circuit 70 of FIG. 7, shown 

machine 80 when they match. When the port is busy in more detail in FIG. 8, includes a state machine 96 and a 

forwarding an ATM transmission to an output port, state comparator 97. Comparator 97 compares the output port's 

machine 80 pulses an NREQ line of the ARB_^A bus to ID to the port ID appearing on the OUTPORT lines of the 

indicate that it does not want to request a connection to an 60 ARB_Abus and signals state machine 96 when they match, 

output port. The arbiter then polls another input port. When When they match, and when the output port is not busy 

the polled input port 62 is not busy forwarding an ATM receiving a transmission from an input port, state machine 

transmission to an output port, a state machine 80 responds 96 asserts the IDLE signal line of the ARB__A bus. 

to the signal from comparator 78 and the POLL signal pulse Output port switch interface circuit 70 also includes a 

by forwarding the PRIORITY line data via a buffer 82 to 65 4B5B decoder and deserializer circuit 98 which receives, 

lines of bus 57 via a buffer 84, by connecting other lines of decodes and deserializes transmissions from the input port 

bus 57 to the OUTPORT lines of the ARB_A bus via a via switch 20 to produce an output sequence of ATM cells. 
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On producing each cell of the sequence, circuit 98 pulses a 
NEXT_CELL signal which shifts the cell into FIFO buffer 
72 of FIG, 7. Circuit 98 also sends SOM and EOM signal 
signals to state machine 96 to indicate when it detects a start 
and end of message codes in its input data stream. These 
signals tell state machine 96 whether the output port is busy 
or idle. 

Ethernet Input Port Architecture 

FIG. 9 illustrates Ethernet input port PRE(0) of FIG. 3 in 
more detailed block diagram form. Ethernet input ports 
PRE(1)-PRE(M) are similar. Ethernet input port PRE(0) 
includes an Ethernet bus interface circuit 100 for receiving 
Ethernet transmissions on input Ethernet line RE(0). Bus 
interface 100 shifts each 4-bit nibble of a transmission into 
a FIFO buffer 102. Also, as the source and destination 
address fields of an incoming Ethernet packet arrive, inter- 
face circuit 100 shifts them into a serial-in, parallel-out shift 
register 104. As the last nibble of the incoming Ethernet 
packet is shifted into FIFO buffer 104, bus interface 100 also 
shifts an EOM bit into FIFO buffer 102. At the same time, 
it sends a TRANSLATE signal to a translation bus interface 
circuit 106. Interface circuit 106 responds by reading the 
source and destination fields out of shift register 104 and 
sending them to address translation lookup table 44 of FIG. 
3 via the TRANS bus. When translation lookup table 44 
returns the switch port ID of the output port that is to receive 
the packet, interface circuit 106 shifts it into a FIFO buffer 
108. When FIFO buffer 108 is not empty it asserts a request 
signal (REQ) to a switch interface circuit 110 to indicate that 
the port is storing a complete Ethernet packet. 

Arbiter 28 of FIG. 3 polls the input port by placing the 
input port's ID on INPORT lines of the ARB_E bus and 
pulsing a POLL line of the ARB_E bus. When the polled 
input port is busy forwarding data, or when the REQ signal 
output of FIFO buffer 108 indicates that FIFO buffer 102 is 
empty, switch interface circuit 110 pulses an NREQ line of 
the ARB__E bus to tell the arbiter 28 that is not requesting 
a new connection. Otherwise if the input port is idle and the 
REQ signal from FIFO buffer 108 is asserted, switch inter- 
face circuit 110 responds to a poll by placing the PORT ID 
output of FIFO buffer 108 on the OUTPUT lines of the 
ARB_E bus and pulsing the REQ line of the ARB_E bus. 
If the output port identified by the ID on the OUTPORT lines 
is idle, arbiter 28 of FIG, 3 connects the input port to the idle 
output port and pulses a GRANT line of the ARB__E bus. 
Switch interface circuit 110 then begins shifting nibbles out 
of FIFO buffer 102, 4B5B encoding and serializing them, 
and forwarding them to the output port via switch 20 of FIG. 
3. The EOM bit shifted out of FIFO buffer 102 with the last 
nibble of the Ethernet packet signals switch interface 110 to 
stop forwarding data and to shift the port ID out of FIFO 
buffer 108 and wait for another poll. 

Ethernet Output Port Architecture 

FIG. 10 illustrates Ethernet output port PTE(0) of FIG. 3 
in more detailed block diagram form. Ethernet output ports 
PTE(1)-PTE(M) are similar. Output port PTE(0) includes a 
switch interface circuit 112 for receiving serialized, 4B5B 
encoded packets from an Ethernet input port via crosspoint 
switch 20 of FIG. 3. Circuit 112 decodes and deserializes the 
packet data to produce an output byte sequence that it shifts 
into a FIFO buffer 114. An Ethernet interface circuit 116 
shifts out and serializes the byte sequence and forwards it 
outward as an Ethernet transmission on Ethernet bus line 
TE(0). 



12 

Ethernet Arbitration 



FIG. 11 illustrates input port switch interface 110 of FIG. 
9 and output port switch interface 112 of FIG. 10 in more 
detailed block diagram form. Arbitrator 28 of FIG. 3 polls an 
Ethernet input port by placing its ID on the INPORT lines of 
the ARB__E bus and pulsing the POLL line of that bus. Input 
port switch interface 110 includes a comparator 118 for 
signaling a state machine 120 when the input port ID on an 
INPORT line of the ARB_E bus matches the input port's 
ID. In such case state machine 120 responds to the POLL by 
pulsing the NREQ signal line of the ARB_E bus if the port 
is busy forwarding data to an output port or if the REQ signal 
from buffer 108 of FIG. 9 is not asserted. Otherwise state 
machine 120 turns on a tristate buffer 122 to place the output 
port ID output of FIFO buffer 108 on the OUTPORT lines 
of the ARB_E bus and then asserts the REQ signal. If the 
output port having that ID is idle, arbiter 28 of FIG. 3 
establishes a connection between the requesting input port 
and the idle output port and then pulses the GRANT signal 
line of the ARB_E bus. State machine 120 signals a code 
generator circuit 124 to send a 4B5B start of message code 
via a multiplexer 126 signals crosspoint switch 20 to output 
port interface circuit 112. State machine 120 then begins 
shifting 4-bit nibbles of the Ethernet packet out of FIFO 
buffer 108 of FIG. 9 and clocking them through a 4B5B 
encoder and serializer circuit 128. The encoded and serial- 
ized nibbles then pass through multiplexer 126 and cross- 
point switch 20 to output port interface circuit 112. An EOM 
bit emerging from FIFO buffer 108 with the last nibble of the 
Ethernet packet signals state machine 120. State machine 
120 then signals code generator 124 to send an end of 
message code to output port interface circuit 112. Thereafter 
state machine 120 considers the input port idle and awaits 
another poll. 

Output port interface circuit 112 includes a 4B5B decoder 
and deserializer circuit 130 for converting the 4B5B 
encoded serial data stream from input port 110 into a byte 
sequence and loading it into FIFO buffer 114 of FIG. 10. 
Circuit 130 also transmits SOM and EOM signals to a state 
machine 132 when it detects the start of message and end of 
message codes in the data stream. When the output port is 
not busy forwarding data from an input port, state machine 
132 asserts the IDLE signal line of the ARB_E bus when a 
comparator 134 signals that the ID on the OUTPORT line of 
the ARB_E bus matches the output port's ID. 

Arbiter 

FIG. 12 illustrates arbiter 28 of FIG. 3 in more detailed 
50 block diagram form. Arbiter 28 includes a state machine 140 
which alternates between ATM and Ethernet arbitration 
modes. In the ATM mode, state machine 140 clocks a 
counter 142 to increment the port ID on the INPORT lines 
of the ARB_A bus and clocks a counter 144 that produces 
55 the priority level conveyed on the PRIORITY lines of the 
ARB_A bus. Each time counter 142 is clocked it increments 
the input port ID until it overflows. At that point it resets its 
output count to 0 and signals state machine 140. When state 
machine 140 resets counter 144, counter 144 outputs the 
60 highest priority code and thereafter decrements that priority 
code each time that it is clocked thereafter. Upon counting 
down to 0, counter 144 resets its output to the highest 
priority code. 

State machine 140 initially resets counter 142. It then 
65 pulses the ARB_^A bus POLL signal and waits until it 
detects an REQ or NREQ signal from the polled input port. 
If an NREQ signal pulse is returned, state machine 140 
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clocks counter 142 to select a next input port to poll and, if Input interface 34A sequentially shifts the payload portion 
counter 142 overflows, clocks counter 144 to select a next of incoming ATM cells into a FIFO buffer 152 and also shifts 
lower priority. State machine 140 then polls the next input in, along with each cell payload, an EOM bit indicating 
port. If the REQ signal is returned in response to an input whether the incoming ATM cell is the last cell of ATM 
port poll, but an output port does not assert the IDLE line of 5 transmission. The header portion of the ATM cells is sup- 
ine ARB^A bus, state machine 140 polls a next input port. plied to a header translation table 156 maintained by the host 
If state machine 140 detects assertion of an REQ signal from via the SYSTEM bus. Header translation table 156 converts 
a polled input port and detects an IDLE signal from a the information in the header to preamble and source and 
requested output port, it sets an output signal A/E to switch destination address fields (SOURCE,DEST) of an Ethernet 
a multiplexer 146 so that it passes the input and output port 10 packet and also produces output PORTID data referencing 
ID'S on the ARB_^A bus INPORT and OUTPUT lines to a the ID of a switch output port to receive the packet. As the 
decoder 148. Decoder 148 converts the input and output port cells are loaded into FIFO buffer 152 a check sum circuit 
ID's to address and data inputs to RAM 42 of FIG. 3 that 158 computes the ERROR field of the packet. When input 
will establish a connection between the input and output interface circuit 34A pulses its output EOM signal, state 
ports through crosspoint switch 20 of FIG. 3. State machine is machine 154 sends an output REQ signal to output interface 
140 next pulses a write signal input to RAM 42 of FIG. 3 to circuit 34B. Thereafter output interface circuit 34B responds 
write the data into RAM 42, thereby establishing the con- to a poll from arbiter 28 of FIG. 3 by sending the PORTID 
nection between the ports. State machine 140 then pulses the from header translation table 156 to the arbiter via the 
GRANT signal time of the ARB- A bus to tell the input port ARB_E bus and requesting a connection. When the arbiter 
that it may begin transmitting data. State machine 140 20 grants the request, output interface circuit 34B begins send- 
thereafter resets counter 144 to produces the highest priority ing NEXT NIBBLE signals to state machine 154 indicating 
output code and begins the polling process once again. when it requires a next nibble of the Ethernet packet to be 

Thus for each priority level arbiter 28 polls each ATM forwarded. State machine 154 controls a multiplexer 160 

input port in turn to determine if that input port is ready to which selects from among nibbles of the PREAMBLE, 

forward an ATM transmission to an idle input port. Arbiter 25 SOURCE and DEST field outputs of header translation table 

works its way down the priority levels until it finds such an 1 56 » tne ERROR field output of check sum circuit 158, and 

input port, establishes the connection, and then starts over the payload output of FIFO buffer 152. State machine 154 

again polling the ports at the highest priority level. also signals FIFO buffer 152 to shift out payload data when 

In the Ethernet mode, state machine 140 clocks a counter needed. In response to a first set of NEXT NIBBLE signals, 

150 to increment the port ID on the INPORT lines of the 30 state machine 154 routes successive nibbles of the 

ARB_E bus. State machine 140 initially pulses the ARB_E PREAMBLE, SOURCE and DEST fields to output interface 

bus POLL signal and waits until it detects a REQ or NREQ circuit 34B - 11 lhen ^gins routing successive nibbles of the 

signal from the polled input port. If an NREQ signal pulse P^load output of FIFO buffer 152 while successively 

is returned, state machine 140 clocks counter 150 to select shifting out cell payloads as necessary. After detecting an 

a next input port to poll then polls the next input port. If the 35 E0M bit emerging from FIFO buffer 152, state machine 154 

polled input port returns the REQ signal, but a requested routes successive nibbles of the ERROR field to output 

output port does not assert the IDLE line of the ARB_Abus, interface circuit 34B. State machine 154 then sends an EOM 

state machine 140 polls a next input port. If a polled input signal to output interface circuit 34B to indicate the end of 

port asserts the REQ signal and a requested output port lne packet. 

asserts IDLE signal, state machine 140 sets output signal 40 Segmentation Unit 

A/E so that switch multiplexer 146 passes the input and m „ ^ . . _ _ _ . 

output port ID'S on the ARB_E bus INPORT and OUT- J ™- " Ulu ^ rates segmentation unit 32 of FIG. 2 in more 

PORT lines to decoder 148 and then pulses the WRITE detailed block diagram form. Segmentation unit 32 includes 

signal input to RAM 42 of FIG. 3 to write data into RAM an in P m ^erface 32A, ^substantially similar to output port 

42. Tlus establishes the connection between the requesting « switch interface 112 of FIG. 11, for receiving serialized, 

input port and the requested output port. State machine 140 4B5B encoded Ethernet packet from an Ethernet input port 

then pulses the GRANT signal to tell the input port that it and inverting it to an output sequence of bytes. Input 

may begin forwarding data. State machine 140 thereafter ? 2A also handles E ^ ernet arbitration via the 

polls a next input port. ARB "?_ buS r ^ a , mamier smnlar t0 ™*?[ po L rt mte 1 rface 

50 circuit 112 of FIG. 11. Segmentation unit 32 further includes 

Reassembly Unit an output interface circuit 32B similar to ATM input port 

FIG. 13 illustrates reassembly unit 34 of FIG. 2 in more switch interface circuit 62 of FIG. 8 for receiving a sequence 

detailed block diagram form. Reassembly unit 34 includes of 4-bit nibbles forming ATM cells and forwarding them to 

an input interface 34A substantially similar to output port an ATM output port in serialized, 4B5B encoded form. 

switch interface 70 of FIG. 8 for receiving a serialized, 55 Output interface circuit 32B uses the ARB_A bus to arbi- 

4B5B encoded ATM cell from an ATM input port and trate for access to ATM output ports in the same manner as 

converting it to an output sequence of bytes. Input interface interface circuit 62 of FIG. 8. 

34A also handles ATM arbitration via the ARB_A bus in a Input interface circuit 34A signals a state machine 162 

manner similar to output port interface circuit 70. Reassem- when it outputs a next byte of an incoming Ethernet packet, 

bly unit 34 also includes an output interface circuit 34B, 60 The bytes forming the PREAMBLE, SOF, TYPE and 

similar to Ethernet input port switch interface circuit 110 of ERROR fields of the packet are discarded. State machine 

FIG. 11, for receiving a sequence of 4-bit nibbles forming an 162 signals a header translation table 164 to acquire the 

Ethernet packet and forwarding them to an Ethernet output bytes forming SOURCE and DEST fields, and shifts bytes 

port in serialized, 4B5B encoded form. Output interface forming the PAYLOAD field of each cell into a FIFO buffer 

circuit 34B uses the ARB_E bus to arbitrate for access to 65 1 66. After receiving the SOURCE and DEST fields, header 

Ethernet output ports in the same manner as interface circuit translation table 164, using information provided by the host 

110 of FIG. 11. via the SYSTEM bus, produces the PORTID of the desti- 
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nation ATM output port, the VPI and VCI fields for the ATM 
cells, and a PRIORITY code for the ATM transmission. 
Upon receiving an EOM bit output of input interface circuit 
32A, state machine 162 shifts the EOM bit into FIFO buffer 
166. Thereafter, when arbiter 28 of FIG. 3 polls output 
interface circuit 32B, interface circuit 32 forwards the pri- 
ority code conveyed on the ARB_A line to a comparator 
168 and asserts a POLL signal. Comparator 168 compares 
the code to the priority output of header translation table 164 
and signals state machine 162 when they match. If state 
machine 162 has received an EOM bit from input interface 
32A indicating that FIFO buffer 166 stores a complete 
Ethernet packet, state machine 162 sends an REQ signal to 
output interface 32B. Interface 32B then requests the arbiter 
to establish a connection to the ATM output port. When the 
connection is granted, output interface 32 sends a series of 
NEXT signal pulses to state machine 162. In response to 
each NEXT pulse, state machine 162 switches a multiplexer 
170 to select a next nibble of a next ATM cell to be 
forwarded from among the output of FIFO buffer 166, the 
VPI and VCI outputs of header translation table 164 and the 
output of an error code generator circuit 172 which produces 
the cell's CRC field. State machine 162 shifts data out of 
FIFO buffer 166 to obtain data for the PAYLOAD fields for 
the ATM cells. After detecting an EOM bit emerging from 
FIFO buffer 166, state machine 162 sends the CRC output 
of error code generator 172 to output interface 32B and then 
forwards an EOM signal to interface 32B to signal the end 
of the ATM transmission. 

Thus has been described a network switch that can route 
both ATM and Ethernet packets and which can translate 
between Ethernet and ATM protocol packets. Since the 
network switch translates between ATM and Ethernet pro- 
tocols only when necessary, system throughput is enhanced. 
Also, since ATM transmissions are buffered at the input 
ports and fed to the reassembly unit on a per virtual channel 
basis without interleaving ATM cells, the reassembly unit 
need not sort out ATM transmissions and need not include 
extensive buffer memory for storing sorted ATM transmis- 
sions prior to reassembly into Ethernet packets. 

While the forgoing specification has described preferred 
embodiments) of the present invention, one skilled in the art 
may make many modifications to the preferred embodiment 
without departing from the invention in its broader aspects. 
For example, while network switch is described as handling 
Ethernet and ATM protocol transmissions, those skilled in 
the art will understand that the basic architecture of the 
switch can be adapted to handle routing and conversion of 
other network protocols. The appended claims therefore are 
intended to cover all such modifications as fall within the 
true scope and spirit of the invention. 

What is claimed is: 

1. A network routing switch for receiving first protocol 
transmissions from external network devices, for converting 
first protocol transmissions to second protocol 
transmissions, and for forwarding received and converted 
first and second protocol transmissions to external network 
devices, the network switch comprising: 

a plurality of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 
a plurality of second ports, each for forwarding second 
protocol transmissions to an external network device; 
first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 
and 
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routing means for selectively interconnecting said first 
and second ports and said first conversion means for 
conveying transmissions therebetween; 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
first protocol transmission is to be converted to a 
second protocol transmission before being forwarded 
to an external network device, 

wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 

wherein each said first protocol transmission received by 
a first port comprises a plurality of sequentially trans- 
mitted cells, each cell including channel data referenc- 
ing a virtual channel, 

wherein a last cell of each first protocol transmission 
includes an end of message code identifying said last 
cell as such, 

wherein at least one of said first ports receives cells of 
separate first protocol transmissions from external net- 
work sources in interleaved fashion, 

wherein each of said first ports stores cells of received first 
protocol data transmissions according to the virtual 
channel referenced by their virtual channel data, and 

wherein each of said first ports sends a stored first 
protocol transmission through said routing means only 
after receiving and storing said last cell of said first 
protocol transmission. 

2. A network routing switch for receiving first protocol 
transmissions from external network devices, for converting 
first protocol transmissions to second protocol 
transmissions, and for forwarding received and converted 
first and second protocol transmissions to external network 
devices, the network switch comprising: 

a plurality of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 

a plurality of second ports, each for forwarding second 
protocol transmissions to an external network device; 

first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 
and 

routing means for selectively interconnecting said first 
and second ports and said first conversion means for 
conveying transmissions therebetween; 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
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first protocol transmission is to be converted to a 
second protocol transmission before being forwarded 
to an external network device, 

wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 

wherein each said first protocol transmission received by 
a first port comprises a plurality of sequentially trans- 
mitted cells, each cell including channel data referenc- 
ing a virtual channel, and 

wherein each said first port includes means for 
determining, from the channel data included in the cells 
of each first protocol transmission received from an 
external network device, whether to send said first 
protocol transmission through said routing means to 
another of said first ports or to said first conversion 
means. 

3. A network routing switch for receiving first protocol 
transmissions from external network devices, for converting 
first protocol transmissions to second protocol 
transmissions, and for forwarding received and converted 
first and second protocol transmissions to external network 
devices, the network switch comprising: 

a plurality of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 

a plurality of second ports, each for forwarding second 
protocol transmissions to an external network device; 

first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 
and 

routing means for selectively interconnecting said first 
and second ports and said first conversion means for 
conveying transmissions therebetween; 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
first protocol transmission is to be converted to a 
second protocol transmission before being forwarded 
to an external network device, 

wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 

wherein each first protocol transmission includes a prior- 
ity data value assigning a priority to said first protocol 
transmission, and 

wherein said routing means routes first protocol transmis- 
sions to any of said first ports and to said first conver- 
sion means with a priority determined by the priority 
data values included in the first protocol transmissions. 

4. The network routing switch in accordance with claim 1 
wherein each first protocol transmission includes a priority 



10 



15 



20 



25 



30 



35 



40 



45 



50 



60 



data value assigning a priority to said first protocol 
transmission, and 

wherein each of said first ports also stores cells of 

received first protocol transmissions according to the 

priority assigned to the transmission. 

5. A network routing switch for receiving first protocol 
transmissions from external network devices, for converting 
first protocol transmissions to second protocol 
transmissions, and for forwarding received and converted 
first and second protocol transmissions to external network 
devices, the network switch comprising: 

a plurality of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 

a plurality of second ports, each for forwarding second 
protocol transmissions to an external network device; 

first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 
and 

routing means for selectively interconnecting said first 
and second ports and said first conversion means for 
conveying transmissions therebetween, 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
first protocol transmission is to be converted to a 
second protocol transmission before being forwarded 
to an external network device, 

wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 

wherein each said first protocol transmission received by 
a first port comprises a plurality of sequentially trans- 
mitted cells, each cell including channel data referenc- 
ing a virtual channel, and 

wherein each second protocol transmission consists of a 
single packet, said packet including a data payload and 
a network destination address of a network device to 
receive the packet. 

6. The network routing switch in accordance with claim 5 
wherein each second port determines from the network 
destination address included in each packet received from an 
external network device whether to send said packet through 
said routing means to another of said second ports or to said 
second conversion means. 

7. A network routing switch for receiving first protocol 
transmissions from external network devices, for converting 
first protocol transmissions to second protocol 
transmissions, and for forwarding received and converted 
first and second protocol transmissions to external network 
devices, the network switch comprising: 

a plurality of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 
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a plurality of second ports, each for forwarding second 
protocol transmissions to an external network device; 

first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 
and 

routing means for selectively interconnecting said first 
and second ports and said first conversion means for 
conveying transmissions therebetween; 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
first protocol transmission is to be converted to a 
second protocol transmission before being forwarded 
to an external network device, 

wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 

wherein each said first protocol transmission received by 
a first port comprises a plurality of sequentially trans- 
mitted cells, each cell including channel data referenc- 
ing a virtual channel, 

wherein a last cell of each first protocol transmission 
includes an end of message code identifying said last 
cell as such, 

wherein at least one of said first ports receives cells of 
separate first protocol transmission from external net- 
work sources in interleaved fashion, 

wherein each of said first ports stores cells of received first 
protocol transmissions according to the virtual channel 
referenced by their virtual channel data, 

wherein each of said first ports sends a stored first 
protocol transmission through said routing means only 
after receiving and storing said last cell of said first 
protocol transmission, and 

wherein each said first port includes means for 
determining, from the channel data included in the cells 45 
of each first protocol transmission received from an 
external network device, whether to send said first 
protocol transmission through said routing means to 
another of said first ports or to said first conversion 
means. 

8. The network routing switch in accordance with claim 7 
wherein each first protocol transmission includes a priority 
data value assigning a priority to said first protocol data 
transmission, and 

wherein said routing means routes first protocol transmis- 
sions to any of said first ports and to said first conver- 
sion means with a priority determined by the priority 
data values included in the first protocol transmissions. 

9. The network routing switch in accordance with claim 8 
wherein each of said first ports also stores cells of received 60 
first protocol transmissions according to the priority 
assigned thereto. 

10. The network routing switch in accordance with claim 
9 wherein each said second protocol transmissions consists 

of a single packet, said packet including a data payload and 65 
a network destination address of a network device to receive 
the packet. 
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11. A network routing switch for receiving differing first 
and second protocol transmissions from external network 
devices, for converting first protocol transmissions to second 
protocol transmissions, for converting second protocol 
transmissions into first protocol transmissions, and for for- 
warding received and converted first and second protocol 
transmissions to external network devices, the network 
switch comprising: 

a set of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 

a set of second ports, each for receiving second protocol 
transmissions from an external device and for forward- 
ing second protocol transmissions to said external 
network device; 

first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 

second conversion means for converting a second proto- 
col transmission into a first protocol transmission; and 

routing means for selectively interconnecting said first 
and second ports and said first and second conversion 
means for conveying transmissions therebetween, 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
first protocol transmission is to be converted to a 
second protocol transmission before being forwarded 
to an external network device, 

wherein each second port receiving a second protocol 
transmission from an external network device stores the 
second protocol transmission until it is complete and 
then sends the stored second protocol transmission 
through said routing means to another of said second 
ports when the second protocol transmission is to be 
forward to an external network device as a second 
protocol transmission, and sends the stored second 
protocol transmission through said routing means to 
said second conversion means when the stored second 
protocol transmission is to be converted to a first 
protocol transmission before being forwarded to an 
external network device, 

wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 

wherein the second conversion unit converts each second 
protocol transmission forwarded thereto into a first 
protocol transmission and sends the first protocol trans- 
mission through said routing means to one of said first 
ports to be forwarded thereby to an external network 
device, 

wherein each said first protocol transmission received by 
a first port comprises a plurality of sequentially trans- 
mitted cells, each cell including channel data referenc- 
ing a virtual channel, 

wherein a last cell of each first protocol transmission 
includes an end of message code identifying said last 
cell as such, 
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wherein at least one of said first ports receives cells of 
separate first protocol transmission from external net- 
work sources in interleaved fashion, 
wherein each of said first ports stores cells of received first 
protocol transmissions according to the virtual channel 
referenced by their virtual channel data, and 
wherein each of said first ports sends a stored first 
protocol data transmission through said routing means 
only after receiving and storing said last cell of said first 
protocol data transmission. 
12. A network routing switch for receiving differing first 
and second protocol transmissions from external network 
devices, for converting first protocol transmissions to second 
protocol transmissions, for converting second protocol 
transmissions into first protocol transmissions, and for for- 
warding received and converted first and second protocol 
transmissions to external network devices, the network 
switch comprising: 

a set of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 
a set of second ports, each for receiving second protocol 
transmissions from an external device and for forward- 
ing second protocol transmissions to said external 
network device; 
first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 
and 

second conversion means for converting a second proto- 
col transmission into a first protocol transmission; 

routing means for selectively interconnecting said first 
and second ports and said first and second conversion 
means for conveying transmissions therebetween; 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
first protocol transmission is to be converted to a 
second protocol transmission before being forwarded 
to an external network device, 

wherein each second port receiving a second protocol 
transmission from an external network device stores the 
second protocol transmission until it is complete and 
then sends the stored second protocol transmission 
through said routing means to another of said second 
ports when the second protocol transmission is to be 
forward to an external network device as a second 
protocol transmission, and sends the stored second 
protocol transmission through said routing means to 
said second conversion means when the stored second 
protocol transmission is to be converted to a first 
protocol transmission before being forwarded to an 
external network device, 

wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 
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wherein the second conversion unit converts each second 
protocol transmission forwarded thereto into a first 
protocol transmission and sends the first protocol trans- 
mission through said routing means to one of said first 
ports to be forwarded thereby to an external network 
device, 

wherein each said first protocol transmission received by 
a first port comprises a plurality of sequentially trans- 
mitted cells, each cell including channel data referenc- 
ing a virtual channel, and 

wherein each said first port includes means for 
determining, from the channel data included in the cells 
of each first protocol transmission received from an 
external network device, whether to send said first 
protocol transmission through said routing means to 
another of said first ports or to said first conversion 
means. 

13. A network routing switch for receiving differing first 
and second protocol transmissions from external network 
devices, for converting first protocol transmissions to second 
protocol transmissions, for converting second protocol 
transmissions into first protocol transmissions, and for for- 
warding received and converted first and second protocol 
transmissions to external network devices, the network 
switch comprising: 

a set of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 

a set of second ports, each for receiving second protocol 
transmissions from an external device and for forward- 
ing second protocol transmissions to said external 
network device; 

first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 

second conversion means for converting a second proto- 
col transmission into a first protocol transmission; and 

routing means for selectively interconnecting said first 
and second ports and said first and second conversion 
means for conveying transmissions therebetween; 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
first protocol transmission is to be converted to a 
second protocol transmission before being forwarded 
to an external network device, 

wherein each second port receiving a second protocol 
transmission from an external network device stores the 
second protocol transmission until it is complete and 
then sends the stored second protocol transmission 
through said routing means to another of said second 
ports when the second protocol transmission is to be 
forward to an external network device as a second 
protocol transmission, and sends the stored second 
protocol transmission through said routing means to 
said second conversion means when the stored second 
protocol transmission is to be converted to a first 
protocol transmission before being forwarded to an 
external network device, 

wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
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protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 
wherein the second conversion unit converts each second 
protocol transmission forwarded thereto into a first 
protocol transmission and sends the first protocol trans- 
mission through said routing means to one of said first 
ports to be forwarded thereby to an external network 
device, 

wherein each first protocol transmission includes a prior- 
ity data value assigning a priority to said first protocol 
transmission, and 

wherein said routing means routes first protocol transmis- 
sions to any of said first ports and to said first conver- 
sion means with a priority determined by the priority 
data values included in the first protocol transmissions. 

14. The network routing switch in accordance with claim 
11 wherein each first protocol transmission includes a pri- 
ority data value assigning a priority to said first protocol 
transmission, and 

wherein each of said first ports also stores cells of 
received first protocol transmissions according to the 
priority assigned to the transmission. 

15. A network routing switch for receiving differing first 
and second protocol transmissions from external network 
devices, for converting first protocol transmissions to second 
protocol transmissions, for converting second protocol 
transmissions into first protocol transmissions, and for for- 
warding received and converted first and second protocol 
transmissions to external network devices, the network 
switch comprising: 

a set of first ports, each for receiving first protocol 
transmissions from an external network device and for 
forwarding first protocol transmissions to said external 
network device; 

a set of second ports, each for receiving second protocol 
transmissions from an external device and for forward- 



forward to an external network device as a second 
protocol transmission, and sends the stored second 
protocol transmission through said routing means to 
said second conversion means when the stored second 
protocol transmission is to be converted to a first 
protocol transmission before being forwarded to an 
external network device, 
wherein the first conversion unit converts each first pro- 
tocol transmission forwarded thereto into a second 
protocol transmission and sends the second protocol 
transmission through said routing means to one of said 
second ports to be forwarded thereby to an external 
network device, 
wherein the second conversion unit converts each second 
protocol transmission forwarded thereto into a first 
protocol transmission and sends the first protocol trans- 
mission through said routing means to one of said first 
ports to be forwarded thereby to an external network 
device, 

wherein each said first protocol transmission received by 
a first port comprises a plurality of sequentially trans- 
mitted cells, each cell including channel data referenc- 
ing a virtual channel, and 
wherein each said second protocol transmissions consists 
of a single packet, said packet including a data payload 
and a network destination address of a network device 
to receive the packet. 

16. The network routing switch in accordance with claim 
30 15 wherein each second port determines from the network 

destination address included in each packet received from an 
external network device whether to send said packet through 
said routing means to another of said second ports or to said 
second conversion means. 

17. A network routing switch for receiving differing first 
and second protocol transmissions from external network 
devices, for converting first protocol transmissions to second 
protocol transmissions, for converting second protocol 
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35 



transmissions into first protocol transmissions, and for for- 

FnT^d~prato^f NM^Tto ~sa7d 'external « waldin S received and converted first and second protocol 

transmissions to external network devices, the network 
switch comprising: 

a set of first ports, each for receiving first protocol 
transmissions from an external network device and for 



network device; 

first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 

second conversion means for converting a second proto- 
col transmission into a first protocol transmission; and 

routing means for selectively interconnecting said first 
and second ports and said first and second conversion 
means for conveying transmissions therebetween; 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
first protocol transmission is to be converted to a 
second protocol transmission before being forwarded $o 
to an external network device, 

wherein each second port receiving a second protocol 
transmission from an external network device stores the 
second protocol transmission until it is complete and 
then sends the stored second protocol transmission 65 
through said routing means to another of said second 
ports when the second protocol transmission is to be 
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forwarding first protocol transmissions to said external 
network device; 

a set of second ports, each for receiving second protocol 
transmissions from an external device and for forward- 
ing second protocol transmissions to said external 
network device; 

first conversion means for converting a first protocol 
transmission into a second protocol data transmission; 

second conversion means for converting a second proto- 
col transmission into a first protocol transmission; and 

routing means for selectively interconnecting said first 
and second ports and said first and second conversion 
means for conveying transmissions therebetween; 

wherein each first port receiving a first protocol transmis- 
sion from an external network device stores the first 
protocol transmission and then sends the stored first 
protocol transmission through said routing means to 
another of said first ports when the first protocol 
transmission is to be forwarded to an external network 
device as a first protocol transmission, and sends a 
stored first protocol transmission through said routing 
means to said first conversion means when the stored 
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first protocol transmission is to be converted to a wherein each of said first ports stores cells of received first 

second protocol transmission before being forwarded protocol transmissions according to the virtual channel 

to an external network device, referenced by their virtual channel data, 

wherein each second port receiving a second protocol wherein each of said first ports send a stored first protocol 

transmission from an external network device stores the 5 transmission through said routing means only after 

second protocol transmission until it is complete and receiving and storing said last cell of said first protocol 

then sends the stored second protocol transmission transmission, an 

through said routing means to another of said second wherein each said first port includes means for 

ports when the second protocol transmission is to be determining, from the channel data included in the cells 

forward to an external network device as a second w of each first protocol transmission received from an 

protocol transmission, and sends the stored second external network device > whether t0 send said first 

protocol transmission through said routing means to protocol transmission through said routing means to 

said second conversion means when the stored second another of d first ports or to said first conversion means, 

protocol transmission is to be converted to a first 18 ^ network routin g switch "> accordance with claim 

protocol transmission before being forwarded to an « 17 wherein each first protocol transmission mcludes a pn- 

external network device, " ont y data value assigning a priority to said first protocol 

t • « * . ^ , - . transmission, and 

wherein the first conversion unit converts each first pro- . . ' _ t . 

tocol transmission forwarded thereto into a second whereul said ,™ tm S means ^ * ret P rotoco1 : transmis- 

protocol transmission and sends the second protocol slons ;° s f ** ports and said first conversion means 

transmission through said routing means to one of said 20 m ° rder ° f P™"* indicated by the priority data values 

second ports to be forwarded thereby to an external 10 m ^ Uded m ^ pTOiOC °} ir ™ m ™ 10ns - . fc , . 

network device The network routing switch in accordance with claim 

' 18 wherein each of said first ports also stores cells of 

wherein the second conversion unit converts each second reccived fifSt ^ traasmissions in groups linked 

protocol transmission forwarded thereto into a first 25 according to the priority to lhe transmission, 

protocol transmission and sends the first protocol trans- 20 ^ network routing switch m accord ance with claim 

mission through said routing means to one of said first 19 wherein each ^ second pTOiocol transmission consists 

ports to be forwarded thereby to an external network of a single packet> said packet including a data pay i oad and 

device, a ne t work d estination address of a network device to receive 

wherein each said first protocol transmission received by 30 the packet. 

a first port comprises a plurality of sequentially trans- 21. The network routing switch in accordance with claim 

mitted cells, each cell including channel data referenc- 20 wherein each second port determines from the network 

ing a virtual channel, destination address included in each packet received from an 

wherein a last cell of each first protocol transmission external network device whether to route said packet 

includes an end of message code identifying said last 35 through said routing means to another of said second ports 

cell as such, wherein at least one of said first ports or to said second conversion means, 
receives cells of separate first protocol transmission 

from external network sources in interleaved fashion, * * * * * 
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PACKET SWITCH AND SWITCHING In the case of the above related art, only one type of the 

METHOD FOR SWITCHING VARIABLE mapping table is referred for queuing packets. The trans- 

LENGTH PACKETS mission priority and the discard priority of each packet are 

defined in the mapping tabic corresponding to the priority 

BACKGROUND OF THE INVENTION 5 information and the protocol information included in the 

(1) Field of the Invention ' " " ^ ~ h " der Consequently in the case 
1/ ... . , of the related art, it is difficult to control a bandwidth for 
Represent invention relates to a packet switch and a each ^ flow ^ tQ ide ^ ke{ sM 

switchmg method more particularly to a packet switch and scfvices d di on> for fe lhe £ m(x or destina ! 

a switching method for switching variable length packets by J0 , ion networkof those packetsia a node apparatus connecting 

controlling transmission of packets according to the priority , ^ of n6 , works ^ ^ ^ 
of respective packets. More concretely, the present invention 

relates to a packet switch and a switching method suitable SUMMARY OF THE INVENTION 

for switching variable length packets in the Internet, as well . . . . , . 

as intranets, and capable for transmitting the variable length , , 1 « ° b ^ ect °f * e mvea{ >™ to P r0Vlde a P ac * 6t 

packets by controlling a bandwidth thereof. 15 \ ^ met ?° d ?" C ° mr ° 1 * 

(2) Description of the Related Art ba " dw,dth ° f ea ? h ° f Tt S P 

~v , \ ... . . . , It is another object of the present invention to provide a 

The packet communication is a communication method , A . . J , . * ... . . \ t 

e 4 j - J4 - c e i • packet switch and a packet switching method that can 

for transmitting and receiving data in a form of a packet v . . « « . . . & , r , e 

, . *ruj a a * guarantee the bandwidth previously reserved for each of 

comprising a predetermined format of header and a data 1Q B . , . . . t a p , 3 . . 

li f u* • j u j- j- > • • * . • 1 1 * M vanable length packet flows by a communication service 

block obtained by dividing transmission data into blocks . * 

contract 

each having a proper length, for example, 48 byte to 1.5 . 

Kbytes in length. A packet switch adopts a so-called stored 11 15 stJ1 another ob ^ { of the P resent invention to provide 

and forward switching method, in which switching of com- a Packet switch and a packet switching method that can 

munication data is carried out after the communication data 25 P erfonn both bandwidth control and priority control for each 

as stored temporarily in the switch, so that it is able to apply vanable len g th P acket flow ' 

a variety of control to the packets passing through the 11 is further another object of the present invention to 

switch. provide a packet switch and a packet switching method that 

In the case of the Internet coming into wide use rapidly in can P erform 0010 bandwidth control and priority control 

recent years, data is communicated as variable length pack- 30 accordin g t0 the communication protocol of variable length 

ets with an IP (Internet Protocol) header respectively. packets. 

Therefore, a packet switching technique is essential to each In order to achieve the above objects, the packet switch of 
node for connecting a network to another. An inter-networks the present invention comprises a plurality of input port 
connecting apparatus (node apparatus) such as a router is interfaces, a plurality of output port interfaces, and a packet 
also provided with the packet switching function even if it 35 switching unit for switching each of variable length packets 
is not called a "packet switch" actually. Consequently, in the received from the input port interfaces to one of the output 
present invention, every network apparatus provided with port interfaces corresponding to the destination address of 
the packet switching function, including the node apparatus the packet, wherein each of the output port interfaces 
having such a specific name as router, will be referred to as comprises a buffer memory for storing transmission packets, 
a packet switch. 40 a transmission priority controller configured so as to classify 
As a prior art related to such the packet switching, a router the transmission packets received from the packet switching 
is disclosed in Japanese Unexamined Patent Publication unit based on a predetermined algorithm and queue each of 
No.7-135512. The prior art intends to provide a router with those transmission packets according to transmission prior- 
a function for controlling the transfer of received packets thereof into one of a plurality of queue groups, each of 
according to the priority thereof and a function for control- 45 which is assigned an individual bandwidth, a transmission 
ling the discard of packets performed when a buffer con- Packet read-out controller for accessing the queue groups of 
gestion occurs. According to the prior art, packets received ln e buffer memory cyclically to read out transmission pack- 
by a packet receiving unit are queued in a buffer memory by els from eacn of those queue groups according to the order 
a priority control unit according to the transmission priority of descending transmission priority while guaranteeing the 
and the discard priority of each packet, then they are 50 bandwidth assigned to each queue group, and a packet 
transmitted to a packet transmission unit according to the transmission circuit for transmitting the transmission pack- 
order of the transmission priority. The priority control unit ets rea d out by the transmission packet read-out controller to 
refers to a mapping table based on the transmission priority an output port associated with the output port interface, 
information and the protocol information included in the According to an embodiment of the present invention, the 
header of each received packet to obtain the processing 55 transmission priority controller is, for example, provided 
priority and the discard priority corresponding to the with means for identifying communication service contract 
received packet, and stores the packet in one of queues and transmission priority related to each of transmission 
prepared corresponding to the discard priority for each packets according to the header information of the trans- 
processing priority. Stored packets are output sequentially mission packet received from the packet switching unit, 
according to the order of descending processing priority. 60 thereby to queue the transmission packet in a queue corre- 
When the free area capacity of the buffer memory goes sponding to the identified transmission priority in a queue 
under a predetermined threshold value, discard control of the group corresponding to the identified communication ser- 
stored packets is carried out to avoid congestion. In this vice contract. 

discard control, packets with higher discard priority are More concretely, the transmission priority controller 

discarded from queues according to the order of ascending 65 includes, for example, a first management table for defining 

processing priority until the free area capacity of the buffer the identifier of the queue group corresponding to the 

memory reaches a target threshold value. combination of a source network address and a destination 
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network address and a second management table for defin- packets temporarily, a relaying priority controller configured 
ing a transmission priority corresponding to the combination so as to filter the packets received from input ports based on 
of a communication protocol and a packet priority, thereby the destination address of the received packets, classify the 
to identify a queue group corresponding to each of the filtered received packets according to their priority based on 
transmission packets by referring to the first management 5 a predetermined algorithm, and queue those classified pack- 
table based on the source network address and the destina- ets in the receiving buffer memory according to the priority, 
tion network address included in the header of the transmis- and a received packet read-out circuit for reading out the 
sion packet, and to identify the transmission priority of the received packets from the receiving buffer memory accord- 
transmission packet by referring to the second management ing to the order of priority to supply those packets to the 
table based on the communication protocol information and 10 packet switching unit. By transferring the packets according 
the packet priority information included in the header of the to the order of priority in both of the input port interfaces and 
transmission packet. the output port interfaces, it is possible to reduce the delay 
According to another embodiment of the present time of m "gh priority packets within the packet switch more 
invention, the transmission priority controller identifies a significantly. 

queue group corresponding to each of the transmission 15 The switching method of the present invention for switch- 
packets based on the protocol information included in the ing variable length packets comprises the steps of: (a) 
header of the transmission packet, then queues the trans- filtering received packets received from an input port based 
mission packet in one of the queues in the identified queue on the destination address of each of those packets; (b) 
group. In this case, for example, the object of switching is transferring the filtered packets to one of the output ports 
an IP packet which is identified by the protocol of the 20 interfaces as transmission packets according to the destina- 
network layer of the OSI (Open Systems Interconnection) tion address included in the header of each of those packets; 
reference model, and transmission packets are classified into (c) classifying the transmission packets based on a prede- 
a plurality of queue groups according to the protocol type of termined algorithm and queuing the classified packets 
the transport layer of the OSI reference model, for example, according to transmission priority into one of the a plurality 
TCP (Transmission Control Protocol), UDP (User Datagram 25 of queue groups assigned with a individual bandwidth 
Protocol), I CMP (Internet Control Message Protocol), or respectively; (d) accessing the plural queue groups cycli- 
IGMP (Internet Group Management Protocol). cally to read out transmission packets from each of those 

In order to guarantee the bandwidth of each queue group, queues according to the order of transmission priority while 
the transmission packet read-out controller of the present guaranteeing the bandwidth assigned to each of the queue 
invention specifies, for example, a queue group among the 30 groups; and (e) transmitting the read out transmission pack- 
queue groups cyclically to read out transmission packets, ets t0 40 out P ut P° re- 
assigns a packet read duration to the specified queue group More specifically, the step (b) of transferring received 
depending on the bandwidth thereof, and reads out trans- packets preferably includes a step (bl) of queuing received 
mission packets according to the order of descending trans- packets according to priority in the receiving buffer and a 
mission priority. 35 step (b2) of reading out the received packets stored in the 

Furthermore, according to an embodiment of the present receiving buffer according to the order of priority and 

invention, the transmission packet read-out controller transferring those packets as transmission packets to one of 

includes a management table for defining a threshold value the output interfaces. 

in proportion to the bandwidth corresponding to each of the In the step (c) for queuing transmission packets, for 

queue groups and a control means for accessing the plurality example, transmission packets are classified by communi- 

of queue groups cyclically to read out transmission packets cation service contract related to each of those transmission 

continuously from each of the queue groups, until the total packets and queued in a queue group corresponding to the 

length of the read out packets exceeds the threshold value communication service contract by transmission priority, 

defined in the management table and for storing the surplus 45 Instead of this, transmission packets may be classified by 

length of the read out packet exceeding the threshold value communication protocol related to each of those transmis- 

as an initial value for use in the counting of the total packet sion packets, so that those transmission packets are queued 

length in the next read cycle. according to transmission priority in a queue group corre- 

The controlling means, for example, checks the initial sponding to the protocol thereof in the step (c). 

value for counting the total packet length associated with 50 

one of the queue groups before reading out transmission BRIEF DESCRIPTION OF THE DRAWINGS 

packets from the queue group. If the initial value exceeds its FI(J j b „ Wock ^ ^ an embodimem 0 f the 

threshold value, the controlling means subtracts the thresh- kef swilch Qf me m invention 

old value from the initial value and switches the object of ^ . . „ , . . , , 

packet read out to a next queue group without reading out 5S FI< ?- 2 i is a configuration of a network to which the packet 

any packets from the current queue group. Consequently, switch of lhe inveDtl0Q a PP hes - 

variable length packets are read out to an output port, while FIG - 3 * a format of packets to be processed by the packet 

the bandwidth is controlled properly in a long-ranged view. switch of the present invention. 

The algorithm provided for the transmission priority FIG. 4 is an example of a mapping table 40 for defining 

controller to classify transmission packets, as well as the 60 a transmission priority corresponding to both protocol and 

bandwidth control function provided for the transmission priority of each packet. 

packet read-out controller can be modified suitably, for FIG. 5 is an example of a mapping table 50 for defining 

example, in response to a control command issued from an a bandwidth according to both source and destination 

external network management terminal. addresses of each packet. 

Another feature of the packet switch of the present 65 FIG. 6 is a drawing for explaining the layout of queues 

invention resides in that each of the input port interfaces formed according to transmission priority in a buffer area 

comprises a receiving buffer memory for storing received 83-i prepared for each bandwidth. 
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FIG. 7 is an example of a management table provided for The transmission priority control unit 5 stores packets 

a packet read-out circuit 81 shown in FIG. 1. received from the relaying unit 4 in a transmission buffer 

FIG. 8 is a flow chart showing the functions of the packet memory 83 (83-1 to 83-j) as transmission packets, 

read-out circuit 81. A feature of the present invention resides in that the 

FIG. 9 is a drawing for explaining the order of packets 5 transmission priority control unit 5 forms a multi- 
read out by the packet read-out circuit 81. dimensional queue structure in the transmission buffer 83 by 

FIG. 10 is a block diagram showing another embodiment classifying transmission packets into a plurality of groups 

of the packet read-out circuit 82-1 shown in FIG. 1. base on lhe header ^formation of each of those packets and 

queuing those transmission packets according to the trans- 

DESCRIPTION OF THE PREFERRED io mission priority for each group. 

EMBODIMENTS \ n t h e illustrated embodiment, the transmission buffer 83 

At first, the configuration and operation of the packet is divided into a plurality of buffer areas, each of which 

switch of the present invention will be described briefly with includes a plurality of queues (Q10 to Qln) . . . (QjO to Qjn), 

reference to FIG. 1. s° that a total of j queue groups 83-1 to 83-j are formed. The 

The packet switch I comprises a plurality of control 15 transmission priority control unit 5, as will be described 
boards (10-1 to 10-L) prepared for each of input/output later > refers to the mapping tables 40 and 50, which will be 
ports, a packet relaying unit (packet switching unit) for described later with reference to FIGS. 4 and 5, based on the 
relaying packets between those control boards 10, and a CODtro1 information (e.g., packet priority information, pro- 
management unit 9 connected to each of the control boards , n toco1 information, network address, etc. as will be described 
10. Each of the control boards 10 comprises an input port later in detail ™ ih reference to FIG. 3) included in the 
interface 20 and an output port interface 21. header of each P acket > thereD y determining the transmission 

Hie input port interface 20 includes a packet receiving f™?* and the 1 ueue V™* of the transmission packet, 
unit 2 for receiving packets flowing on a network from an Each °* the queue groups as will be described later, is 
input port IN, a relaying priority control unit 3 for storing the 25 assi g. ned W1 * « ^dividual bandwidth and each of trans- 
packets received from the packet receiving unit in a receiv- misslon P ackets 15 stored m tne ? ueue 10 one of f hose 
ing buffer memory 72 according to the priority, and a 1 ueue S^P 8 corresponding to the transmission priority, 
received packet queuing unit 7 for transferring the packets ^ transmission packet queuing unit 8 mcludes first 
stored in the receiving buffer memory 72 to the packet stage read-out circuits 82 (82-1 to 82-j) for reading out 
relaying unit 4 according to the priority. * 30 packets according to the transmission priority from each 

The output port interface 21 includes a transmission ^ eue g™ u P < buffer « a and a next stage read-out 
priority control unit 5 for storing the packets received from circuit 81 for outputting the packets read out by those 
the relaying unit 4 selectively in one of a plurality of queue first-stage read-out circuits to the transmission unit 6 accord- 
groups formed in a transmission buffer 83 (83-1 to 83-j), a m S t0 the bandwidth assigned to each queue group, 
transmission packet queuing unit 8 for reading out the 35 The packet transmission unit 6 transmits packets received 
packets stored in the transmission buffer 83 according to the from the read-out circuit 81 to an output port OUT. The 
transmission priority while guaranteeing the bandwidth output port identifier added by the relaying priority control 
assigned to each queue group, and a packet transmission unit umt 3 to each received packet is removed by the transmis- 
6 for transmitting the transmission packets read out by the s i° n priority control unit 5 or the packet transmission unit 6. 
transmission packet queuing unit 8 to an output port OUT 40 Grouping of transmission packets performed by the trans- 
connected to a network. mission priority control unit 5 is made, to put in the concrete, 

The relaying priority control unit 3 is provided with a for each communication service contract between a line 
filtering function of referencing to a routing table based on provider and a network user or customer user group. The 
the destination address included in the header of each read-out circuit 81 reads out transmission packets sequen- 
received packet to determine whether the transmission 45 tialr y from each of the queue groups so as to adapt to the 
packet should be relayed to another network or not. If it is communication service level promised by each contract, 
determined that the transmission packet should be relayed to A communication service contract mentioned here means 
another network, the packet is given an output port identifier, a bandwidth guarantee contract exchanged between a net- 
then queued in the receiving buffer memory 72 according to work user and a network line provider (or an Internet 
the priority. More concretely, the relaying priority control 50 communication provider). More concretely, it means a con- 
unit 3 determines a relaying priority based on the manage- tract between a business company or a public agency and a 
ment information (packet priority information, protocol line provider. According to such a contract, a communica- 
information, network address, etc., which will be described tion service level including such items as communication 
later in detail with reference to FIG. 3) and stores the quality (e.g., packet delay time, delay variability, packet 
received packet in a queue Qi corresponding to the deter- 55 discard rate, etc.), an assigned bandwidth, etc. is guaranteed 
mined relaying priority. The queue Qi is one of a plurality of for each user. Each of those network line providers is thus 
packet queues QI to Qn formed by relaying priority in the responsible for the contracted communication service level 
receiving buffer memory 72. with respect to each packet flow on the subject network. 

In the received packet queuing unit 7, packets are read out A network manager can change the packet queuing algo- 

by the packet read-out circuit 71 sequentially from the 60 rithm in the transmission priority control unit 5 and the 

queues formed in the receiving buffer memory 72 according read-out algorithm in the read-out circuits 81 and 82 via the 

to the order of descending priority, then output to the packet management unit 9 by issuing a command, for example, 

relaying unit 4. The relaying unit 4 comprises, for example, from the management terminal PT shown with a reference 

a bus or crossbar switch for switching the packets received numeral 16Cn in FIG. 2. 

from the receiving packet queuing unit 7 provided in each 65 The packet queuing algorithm used by the transmission 

control board 10 to one of the other control boards identified priority control unit 5 can be changed by updating the 

by the output port identifier added to the packet. contents in the mapping tables 40 and 50 and the packet 



06/11/2004, EAST Version: 1.4.1 



US 6,570,876 Bl 

7 8 

read-out algorithm used by the read-out circuit 81 can be for each of those packet flows or according to the protocol 

changed by updating the contents in the management table and the packet priority of each packet flow. 

60 as will be described later. The second form of usage allows only one contract, for 

Next, a configuration of the packet switch of the present example, the packet flow of the contract A to use the full 

invention, as well as how to use the packet switch will be 5 bandwidth of the line 14C through the head-of-line sched- 

described with reference to FIG. 2. In ^ s * ase ' m 5 guaranteed bandwidth is set as 0 for 

_ , , _ , all the packet flows of other contracts except for the contract 

The network shown in FIG. 2 comprises sub-networks ^ md packet switching for these contracts are carried out 

15An, 15Bn, 15Am, and 15Bm, and a backbone network using the residual bandwidth of the contract A. According to 

15Cn for connecting those sub-networks to each other. Each m is second form of usage, the user of a contract B can make 

of the sub-networks 15An to 15Bm is connected to packet a communication with a less cost by using the residual 

communication terminals 16 An, 16Bn, 16 Am, and 16Bm bandwidth of the contract A, although the communication 

respectively. quality is not guaranteed for the contract B. If there are a 

Each of the sub-networks 15An to 15Bm is provided with plurality of packet flows for the same contract, packet 

packet switches lAn, IBn, lAm, and lBm. The packet switching is carried out equally to each of the packet flows 

switches lAn and IBn are connected to a packet switches or according to the protocol and the priority of each packet 

lCn in the network 15Cn through the lines 14An and 14Bn flow ^ like m the 6x51 form of u^S*- 

respectively. The packet switches lAm and lBm are con- None of the conventional packet switches is provided 

nected to a packet switch lCm in the network 15Cn through with such tbe ^ c ^ oa for guaranteeing communication 

the lines 14Am and 14Bm respectively. „ n services for respective contracts stated above and the func- 

. . . , . i i ' * a «u i * lion for controlling transmission of a plurality of packet 

In the backbone network 15Cn are connected the packet flows ^ ^ QMt ^ing^o the ^ ^ 

switches lCn and lCm to each other through a hue 14C. respect tQ sucfa y&dMQ leQgth Ip p £ ckels ' 

The packet switches 1 of the present mvenUon shown in FIG. 3 shows a format of the packets handled by the 

HG. 1 are used as packet switches lCn and lCm of the ket switcfa of ^ m invention> 

backbone network. 16Cn indicates a management terminal 25 A „ n * f ™ ■ n . . lt . o Antn 

j. . r.. . i a j- A packet 30 comprises a header portion 31 and a data 

operated by the manager of the network 15Cn. According to r ~~ ~, . K . , , , . . 

/ 3 , . ,% . . . . ^ J% portion 32. The header portion 31 mcludes packet priority 

the commands issued from this management terminal 16Cn, f c t ^ • t i A * » ■ e *• 

. 4 . . - *• u u • .u i 4 •* u a mformation 33, a service request 34, protocol information 

the control information held in the packet switches lCn and ' ~, ? . . \ . ^_ ^ 

i r> .i_ ♦ • .i_ , t . tL 4 u m en * en u 35, a source address 36, and a destmation address 37. The 

1 Cm, that is, the contents m the tables 40, 50, and 60 can be , . . . 4 . c . . 

^ a j e( j ' 30 P ac ^ et P nont y information 33 indicates the processing pn- 

U P a e ority of the packet. The service request 34 indicates the 

Generally, a packet communication can perform a multi- information of a service requested by the packet, for 

plex communication, wherein a bandwidth indicating a example, a request for reliability and high speed perfor- 

communication capability of a line per unit time is divided mance 7^ items m equivalent to the priority (0 to 2nd 

to assign the bandwidth to a plurality of users. bits ) ? lhe low delay request ( 3rd bii ^ the ^ gh through-put 

In the case of the network shown in FIG. 2, the commu- 3 request (4th bit), and the high reliability information (5th bit) 

nication line 14C of the backbone network is shared by the of the TOS (Type Of Service) field defined in the header of 

communication between the sub-networks 15 An and 15 Am each IP packet used for the Internet, 

and the communication between the sub-networks 15Bn and The protocol information 35 indicates the type of the 

15Bm. Packet communication allows each line to be used communication protocol used for the packet, that is, the type 

effectively through the statistical multiplexing effect by D f the packet. If the packet 30 is an IP packet, the protocol 

sharing a line among a plurality of communications such information 35 identifies the type of the transport layer 

way and by assigning a bandwidth to each of those com- header 39 following the header 31. For the most frequently 

munications only when it is needed actually. use d protocols, TCP (Transmission Control Protocol), UDP 

In this case, it is assumed that the communication between 45 (User Datagram Protocol), ICMP(Internet Control Message 

the sub-networks 15An and 15Am is executed under the Protocol), and IGMP (Internet Group Management 

contract A and the communication between the sub- Protocol) are well known. 

networks 15Bn and 15Bm is executed under the contract B, The source address 36 indicates the address of a source 

and the network provider who owns the backbone network device from which the packet is transmitted. The destination 

15C provides packet flows on the line 14C with packet 5Q address 37 indicates the address of a destination device to 

switching services according to the service levels guaranteed which the packet is forwarded. These addresses correspond 

by the contracts A and B. This example is equivalent to a to the SA (Source IP Address) and the DA (Destination IP 

case in which the sub-networks 15 An and 15 Am are, for Address) in a case of IP address system, 

example, of the head office and a branch office of a company Th e IP address comprises an IP network address and an IP 

A and the sub-network 15Bn and 15Bm are of the head office 55 host address. The IP network address identifies a sub- 

and a branch office of a company B, and each of those network connected to a source or destination terminal and 

companies A and B perform their intranet communications tne jp nosl address identifies the source or destination 

according to the contracts with the Internet communication terminal. The communication service contract described 

provider C. above can be identified with the combination of such a 

The packet switch of the present invention provides the 60 source IP network address and a destination IP network 

following two forms of usage for a plurality of contract address. 

parties who share the line 14C. Next, description will be made for the mapping tables 40 

The first form of usage distributes the bandwidth of the and 50 referred by the transmission priority control unit 5 to 

line 14C to both contracts A and B equally or with a weight classify transmission packets. 

according to the contract contents respectively, through the 65 FIG. 4 shows an example of the mapping table 40 referred 

rate based scheduling. If there are a plurality of packet flows to determine transmission priorities among a plurality of 

for the same contract, packet switching is carried out equally packet flows transmitted under respective contracts. 
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The mapping table 40 is a two dimensional matrix 83-1. For each combination of a source network address and 

arranging, for example, the protocol 35 included in each a destination network address, which is not included in any 

packet header 31 as column information 41 and the priority specific contract, a null indicated with a symbol is set. 

information 33 included in the packet header 31 as row Packets having the address information corresponding any 

information 42, wherein a value is assigned to indicate the 5 of such combinations are discarded by the transmission 

transmission priority corresponding to each combination of priority control unit 5. Discarding of such packets that are 

the priority information 33 and the protocol 35. Each value communicated between non-contracted networks may con- 

(0 to 7) of the transmission priority defined in the mapping tribute to improve the security of the packet communication, 

table 40 also indicates the value of a second index k of the The transmission priority control unit 5 refers to the two 

queue Qik formed in the transmission buffer area 83-i shown 10 types of mapping tables 40 and 50 as described above in 

in FIG. 1. order to queue transmission packets in the transmission 

The type of the protocol 35 may be classified into, for buffer 83. Consequently, a plurality of queue groups 83-1 to 

example, TCP and UDP according to the transport layer 83-j are formed in the transmission buffer 83. Each of those 

protocol of the OSI reference model. It may also be classi- queue groups 83-1 to 83-j has an identification number 

fied according to a layer upper than the transport layer, for 35 defined in the mapping table 50 as shown in FIG. 1. And, in 

example, into HTTP (Hyper Text Transfer Protocol), FTP each of those queue groups are formed a plurality of queues 

(File Transfer Protocol), TELNET (Terminal Connection), corresponding to a transmission priority defined in the 

SMTP (Simple Mail Transfer Protocol), and etc. depending mapping table 40 respectively as shown in FIG. 6. 

on the port number if the protocol type is TCP. Transmission packets stored in the queue groups 83-1 to 

Generally, in the case of an interactive type or connection 20 83-j are read out by the packet read -out circuits 82-1 to 82-j 

oriented network protocol such as TCP, the source terminal according to the transmission priority for each queue group, 

transmits a next packet after receiving an ACK packet from then transmitted to the packet transmission unit 6 by the 

the destination terminal. On the contrary, in the case of a packet read-out circuit 81, while the contracted bandwidth of 

batch type or connectionless network protocol such as UDP, each queue group is guaranteed. 

the source terminal can transmit packets one after another 25 Hereunder, the operation of the packet read -out circuit 81 

without waiting for an ACK packet from the destination will be described with reference to FIGS. 7 and 8. 

terminal. Consequently, if a higher transmission priority is FIG. 7 shows a configuration of the management table 60 

given to packet flows of the interactive type or connection which is referred by the packet read-out circuit 81 to control 

oriented network protocol and a lower priority is given to bandwidths. The management table 60 comprises a plurality 

packet flows of the batch type or connectionless network 30 of records corresponding to the queue groups 83-1 to 83-j. 

protocol in the mapping table 40, it is able to improve the Each of those records comprises a queue group identification 

total through-put of the packet switch 1. number 61, a bandwidth 62 assigned to each queue group, 

FIG. 5 shows a configuration of the mapping table 50 and a counter area 63 for counting the length of transmission 

referred by the transmission priority control unit 5 to iden- 35 data read out from each queue group, 

tify the contract associated with each of the transmission FIG. 8 shows a flow chart for a control operation per- 

packets and to specify a queue group corresponding to the formed by the packet read -out circuit 81. 

contract from among the queue groups 83-1 to 83-j shown Tn e packet read-out circuit 81 initializes count values in 

in FIG. 1. the counter area 63 of the management table 60 and the 

The mapping table 50 is a two-dimensional matrix arrang- 40 value of a parameter p to zero respectively (steps 102 and 

ing the source network address included in the source 104), then increments the value of the parameter p (step 

address 36 of each packet header 31 as column information 106). After this, the packet read-out circuit 81 compares the 

51 and the destination network address included in the value of CNT(p) of the counter 63 with the value of BW(p) 

destination address 37 of the packet header 31 as row of the bandwidth 62 in the p-th record of the management 

information 52, wherein an identification number (contract 45 table 60. If the CNT(p) is equal to or more than the BW(p), 

identifier) of queue group is assigned corresponding to each the packet read-out circuit 81 subtracts the value of BW(p) 

combination of a source network address and a destination from the value of CNT(p) (step 120), then compares the 

network address. parameter p with the maximum value j (step 124). If the 

The reason why transmission packets are buffered in value of the parameter p is less than the maximum value j, 

queue groups by contract in the present invention is that 50 tne control sequence returns to step 106, otherwise returns to 

reading of packets from the transmission buffer 83 is con- step 106 after clearing the value of parameter p to zero, 

trolled so as to provide each transmission packet flow with If the CNT(p) value is less than the BW(p) value in step 

a contracted bandwidth. In the embodiment shown in FIG. 108, the packet read-out circuit 81 reads out a packet from 

5, a contract is identified by the combination of a source the p-th queue group QG(p) of the output buffer 83 and 

network address and a destination network address, and the 55 transmits the packet to the packet transmission unit 6 (step 

identification number of a queue group corresponding to 110). If there is no output packet to be read out in the queue 

each contract is defined in the mapping table 50. Each group QG(p)(step 112), the packet read-out circuit 81 clears 

contracted bandwidth is shown in parentheses for reference the counter CNT(p) value to zero (step 122), then performs 

in the mapping table 50. the processing of step 124. When the packet read-out circuit 

A queue group identification number as described above 60 81 rcads out a packet from the queue group QG(p), the 

is assumed as the value of the first index i of the queue Oik packet read-out circuit 81 adds the packet length to the 

formed in the transmission buffer area 83-i. In the illustrated counter CNT(p) (step 114), then returns to step 108. 

example, the communication between a terminal connected FIG. 9 shows an explanatory view of a sequence of 

to the sub-network Am and a terminal connected to the packets read out through the read control operation 

sub-network An is carried out under the contract A to which 65 described above. 

the bandwidth BW1 is assigned. It will thus be understood It is assumed here that packets are read out from three 

that packets for this contract A are queued in the queue group queue groups QG(1), QG(2), and QG(3) to which band- 
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widths BW(1), BW(2), and BW(3) are assigned, According to the packet read-out control of the present 

respectively, for simplifying the description. The rate of the invention, when a bandwidth guaranteed contractor stops 

above bandwidths is assumed to be BW(1):BW(2):BW(3)= communications, the bandwidth for the contractor is 

3:2:1. released to the users of the residual bandwidth BWj. It is 

According to the control flow chart shown in FIG. 8, the 5 assumed that, as an exaggerated example, a specific user X 

value of parameter p is changed so that each queue group makes a contract for guaranteeing the exclusive use of the 

can get a packet read cycle cyclically. Packets are read out m11 bandwidth of the output line 14C and the residual 

from each queue group continuously until the total length of bandwidth of the user X is assigned to the other users. In this 

read packets exceeds the value of a contracted bandwidth case » b Y modifying the control flow chart shown in FIG. 8, 

BW(p) registered in the management table 60. When the * 0 il k also a *>le to realize such a packet read-out control of the 

total length of the read packets exceeds the value of the head-of-line scheduling that release the full bandwidth of the 

contracted bandwidth BW(p), the packet read-out circuit 81 line 14C to the other users until the next packet flow of the 

subtracts the contracted bandwidth BW(p) value from the user * arrives whenever the packet flow of the user X stops, 

counter CNT(p) value, then transfers the packet read cycle In the case of such a h ead-of-the-line scheduling 

to next queue group. 15 sequence, packets are basically read out repetitively from a 

Consequently, as shown in FIG. 9, when packets PI, P2, queue group of the user X. If there is no packet to be read 

and P3 are read out from the first queue group QG(1), the out in the queue group of the user X, one of the other queue 

read cycle is transferred to the second queue group QG(2). groups is accessed. In this case, packets may be read out one 

At this time, the counter CNT(1) value of the queue group after another by checking whether there is a new packet 

QG(1) indicates the length of the residual part of the packet 20 arrived in the queue group of the user X each time one 

P3 shown with a dotted line. packet is read out. According to the present invention, 

When packets P4 and P5 are read out from the second therefore, it is possible to realize not only the first usage 
queue group QG(2), the read cycle is transferred to the third form, but also the second usage form, thereby providing 
queue group QG(3). Since the contracted bandwidth of the 25 communication services appropriately to both of the band- 
third queue group QG(3) is small, the read cycle is trans- width contractor and other non-contractors, 
ferred to the first queue group QG(1) when one packet P6 is ^ ^ ^ ^ of packets to be 
read out in this example. read QUt from each queue groupj for examplej the order of 

When packets P7 and P8 are read out from the first queue ^e packets pi p 2> P3> P7, ... in the first queue group 

group QG(1), the read cycle is transferred to the second 30 qg(1). In other words, the contracted bandwidth BW(i) 

queue group QG(2). In this example, since a long packet P5 assigned to each queue group is further divided by the 

was read out from the second queue group QG(2) in the rea d-out circuit 82 so as to be assigned to a plurality of 

previous read cycle, the value of the counter CNT(2) queues Qi l0 Qin in the queue group 
exceeds the value of the contracted bandwidth BW(2). 

Consequently, in this read cycle, no packet is read out from 35 I fl tms case » now t0 assign a bandwidth to each queue is 

the second queue group QG(2) and the read cycle is trans- va ned widely. In the most basic method, packets are read out 

ferred to the third queue group QG(3) after performing from the queues in the order of descending priority and 

subtraction on the value of the counter CNT(2). By repeating subsequent priority queue is accessed to read out packets 

the read operations stated above, a series of packets are sent wnen tne higher priority queue becomes empty. Instead of 

out to the output port OUT sequentially, in the order of PI, ^ lhis method, it is also possible to assign a bandwidth to each 

P2, . . . , P14. Thus, although the contracted bandwidth is queue according to the order of priority beforehand, then 

exceeded temporarily, as just like outputting of packets P4 g* ve a different read-out time to each queue just like the 

and P5 from the queue group QG(2), the contracted band- method used by the read-out circuit 81. In this case, the 

widths of all the queue groups are guaranteed for packet controlling procedure is modified so that the queue group 

switching in a long-range view. 45 QG(p) is replaced with the queue Q(p) in the flow chart 

According to the controlling of packet read-out described shown in FIG. 8. 

above, if it is detected that there is no transmission packet in i n order to simplify controlling of packet reading more, it 

any one of the queue groups, a chance to read out packets is also possible to assign the number of packets readable 

comes round to residual queue groups in a shorter cycle. continuously for each queue beforehand regardless of the 

Consequently, it becomes possible for residual queue groups 50 packet length so that packets are read out from the next 

to effectively use the bandwidth assigned for the empty queue when the assigned number of packets are read out 

queue group. In addition, if all the queue groups except one fr 0m one queue according to the order of descending pri- 

become empty, read cycles are given to the residual queue ority. It is also possible to read out packets one by one from 

group almost continuously. As a result, the queue group can eacn queue simply according to the order of descending 

use the bandwidth of the output line all to itself, thus 55 priority, if necessary, 

receiving a benefit of services over its contracted level. n , 

1 ttt^ c .u • ,u • a * * u According to the flaw of output packets from the output 

In FIG. 5, the j-th queue group is used to store such , ff OT , 6 , L . . . JL *. . . . ._ 0 i • 

. . 1 . «u . u . u« e buffer 83 to the output port OUT, the read-out circuit 82 is 

transmission packets that have an extra combination of a , . . . e j . ■ •* o* u 

-ijj j . . • . • . . , located in front of the read -out circuit 81. However, in a 

source network address and a destination network address ..<•._. j . . 1 j 

j 1 , ... , _ viewpoint of the read-out control procedure, a queue group 

other than that correspond to bandwidth guarantee contracts 60 . , \ .jc.i_.i_ _ •_ _i_ __. 

j • .i_ _ i_i en. w-i .1 • .« , is determined first by the read-out circuit 81. then the 

registered m the table 50. For the j-th queue group, assigned " _ . e . J ,. 4 . . " 

is a residual bandwidth BWj which is a residual bandwidth read -° m c / cm 82 "nwpondiog to the determtned queue 

of the output line left by the other contractors. He user of 8™P reads out packets from one of the queues .n the group 

*u • i * i . . ... v,-- r , . , . . , .... as shown in the flow chart in FIG. 8. 
the residual bandwidth BWj shares the residual bandwidth 

with other non-contract users, so the user can receive a 65 In the packet switch shown in FIG. 1, a read-out circuit 82 

benefit of less-cost communications, although the commu- is disposed for each queue group in order to make it easier 

nication service quality is not so good. to understand the present invention. In this case, while 
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packets are read out from one queue group and passed to the 
next stage read-out circuit 81, none of the read-out circuits 
82 in other queue groups transmits packets. Consequently, 
all the packets stored in the output buffer 83 may be read out 
by one read-out circuit 82 by switching over the queue 
groups in turn, in an actual application. 

In addition, if an address table for indicating the addresses 
of the first packets of the respective queues is prepared for 
each queue group and if the address table is referred in 
accordance with the queue group QG(p) specified in step 
110 in FIG. 8 to read out packets from a queue identified by 
the priority, the functions of both read-out circuits 81 and 82 
can be realized by a processor or a dedicated hardware 
circuit. 

FIG. 10 indicates the packet read-out circuit 82 in another 
embodiment. 

This packet read-out circuit 82 comprises a complete 
priority type controlling circuit 821 for controlling the read 
out of packets so that all the packets are read out from the 
first priority queue, then from the next and subsequent 
priority queues sequentially, a rate based scheduler type of 
controlling circuit 822 for controlling the reading out of 
packets so that packets are read out from a plurality of 
queues in a queue group according to each assigned 
bandwidth, and a selector circuit 823 for selecting any one 
of the two controlling circuits according to a control signal 
from the management unit 9. One of the controlling circuits 
821 and 822 is activated according to the control signal from 
the management unit 9. According to this embodiment, the 
read-out mode of the transmission packet in the packet 
switch 1 can be switched over in a moment by a command 
supplied from a management terminal operated by the 
network manager. 

In the above embodiment, description has been made for 
packet switching in the network layer of the OSI reference 
model represented by IP packets, but the technical concept 
of the present invention may also apply to the data link layer 
of the OSI reference model, thereby controlling the band- 
width for each of the queue groups described above, for 
example, in an ETHERNET LAN switch. 

In the embodiment described with reference to FIG. 5, the 
bandwidth of each queue group is controlled under each 
contract identified from both source and destination network 
addresses. The present invention may also apply to another 
embodiment in which the transmission priority control unit 
5 classifies transmission packets by protocol, for example, 
TCP and other protocols into two groups, and transmission 
packets are queued into a plurality of queues for each queue 
group according to the packet priority included in the header 50 
information of each packet. In this case, bandwidths may be 
assigned to queue groups in such a manner that the whole 
bandwidth of an output line is assigned only to TCP packet 
flows and the residual bandwidth is assigned to the packet 
flows of other protocols. 55 

As to be understood from the above description, accord- 
ing to the present invention, it is possible to provide a packet 
switching in which the packet transmission priority is 
changed for each network connected to the node apparatus 
(packet switch) in a communication network such as the 60 
Internet in which a plurality of networks are connected 
through a common line. In addition, according to the present 
invention, it is also possible to provide a packet switch and 
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What is claimed is: 

1. A packet switch, comprising: 

a plurality of input port interfaces; 
a plurality of output port interfaces, and a packet relaying 
unit for switching variable length packets received 
from said input port interfaces to one of said output port 
interfaces corresponding to the destination address of 
each of said packets, 
wherein each of said output port interfaces comprises: 
a buffer memory for storing transmission packets, 
a transmission priority controller for classifying trans- 
mission packets, passed from said packet relaying 
unit into a plurality of queue groups to which specific 
bandwidths are assigned respectively, according to 
the header information of each of said transmission 
packets and for queuing said transmission packets in 
said buffer memory so as to form a plurality of 
queues according to transmission priority in each 
queue group, 

a packet read-out controller for reading out transmis- 
sion packets according to the transmission priority 
while guaranteeing the bandwidth assigned to each 
of said queue groups, and 

a packet transmission circuit for transmitting said trans- 
mission packets read out by said packet read -out 
controller to an output port associated with the 
output port interface, 

wherein said transmission priority controller includes a 
first management table for defining an identifier for 
each of said queue groups corresponding to a com- 
bination of a source network address and a destina- 
tion network address and a second management table 
for defining the transmission priority corresponding 
to a combination of a communication protocol and a 
packet priority, and 

wherein said transmission priority controller identifies 
a queue group corresponding to each of said trans- 
mission packets by referring to said first manage- 
ment table based on the source and destination 
network addresses included in the header of the 
transmission packet and identifies the transmission 
priority corresponding to the transmission packet by 
referring to said second management table based on 
the, communication protocol information and packet 
priority information included in the header of the 
transmission packet. 

2. A packet switch according to claim 1, wherein said 
packet read-out controller comprises: 

a management table for defining a threshold value in 
proportion to said bandwidth corresponding to each of 
said queue groups; and 

control means for accessing a plurality of said queue 
groups cyclically to read out transmission packets 
continuously from each of said queue groups until the 
total length of read out packets exceeds the threshold 
value defined in said management table, and for storing 
a surplus length of a packet read out over the threshold 
value as an initial value for counting the total length of 
packets in the next cycle to read out packets from the 
queue group. 

3. A packet switch according to claim 2, wherein said 
control means includes means for checking, before reading 
transmission packets from each of said queue groups, the 



a packet switching method that can change the transmission 

priority even with a communication protocol as needed 65 initial value for counting the total length of packets associ- 

while the communication bandwidth is guaranteed for each ated with each of said queue groups, subtracting the thresh- 

user according to the communication service contract. old value from the initial value if the initial value of one 



06/11/2004, EAST Version: 1.4.1 



US 6,570,876 Bl 



15 



16 



queue group exceeds the threshold value, and switching a 
queue group to be an object for reading out transmission 
packets therefrom, from said one queue group to a next 
queue group without reading transmission packets from said 
one queue group. 

4. A packet switch, comprising: 
a plurality of input port interfaces; 
a plurality of output port interfaces, and a packet relaying 
unit for switching variable length packets received 
from said input port interfaces to one of said output port 
interfaces corresponding to the destination address of 
each of said packets, 
wherein each of said output port interfaces comprises: 
a buffer memory for storing transmission packets, 
a transmission priority controller for classifying trans- 
mission packets, passed from said packet relaying 
unit into a plurality of queue groups to which specific 
bandwidths are assigned respectively, according to 
the header information of each of said transmission 
packets and for queuing said transmission packets in 
said buffer memory so as to form a plurality of 
queues according to transmission priority in each of 
said queue groups, 
a packet read-out controller for reading out transmis- 
sion packets according to the transmission priority 
while guaranteeing the bandwidth assigned to each 
of said queue groups, and 
a packet transmission circuit for transmitting said trans- 
mission packets read out by said packet read-out 
controller to an output port associated with the 
output port interface, 
wherein said transmission priority controller specifies, 
based on the protocol information included in the 
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header of each of transmission packets passed from 
said packet relaying unit, the queue group corre- 
sponding to the transmission packets, whereby each 
of said transmission packets is queued in one of the 
queues in the specified queue group. 

5. A packet switch according to claim 4, wherein said 
packet read-out controller comprises: 

a management table for defining a threshold value in 
proportion to said bandwidth corresponding to each of 
said queue groups; and 

control means for accessing a plurality of said queue 
groups cyclically to read out transmission packets 
continuously from each of said queue groups until the 
total length of read out packets exceeds the threshold 
value defined in said management table, and for storing 
a surplus length of a packet read out over the threshold 
value as an initial value for counting the total length of 
packets in the next cycle to read out packets from each 
of said queue groups. 

6. A packet switch according to claim 4, wherein said 
control means includes means for checking, before reading 
transmission packets from each of said queue groups, the 
initial value for counting the total length of packets associ- 
ated with each of said queue groups, subtracting the thresh- 
old value from the initial value if the initial value of one 
queue group exceeds the threshold value, and switching a 
queue group to be an object for reading out transmission 
packets therefrom, from said one queue group to a next 
queue of group without reading transmission packets from 
said one queue group. 
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